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Introduction 


The  Research  Laboratory  of  Electronics 

The  Research  Laboratory  of  Electronics  (RLE)  was  established  in  1946  as  the  Institute’s 
first  interdepartmental  laboratory.  Originally  organized  under  the  joint  sponsorship  of  the 
Departments  of  Physics  and  Electrical  Engineering,  RLE  has  broadened  its  interests  to 
cover  a  wide  range  of  research. 

The  RLE  environment  provides  both  the  freedom  of  action  essential  in  an  academic  insti¬ 
tution  and  the  availability  of  large-scale  laboratory  facilities  and  services  required  by 
researchers.  RLE’s  interdisciplinary  setting  offers  many  opportunities  for  creative  and 
collaborative  research.  By  fostering  this  powerful  combination  of  research  and  education, 
RLE  effectively  penetrates  beyond  the  horizon  of  new  ideas  and  information. 


RLE  Progress  Report 

RLE  Progress  Report  Number  135  describes  research  programs  at  RLE  for  the  period 
January  1  through  December  31,  1992.  Each  chapter  of  the  Progress  flepo/t  contains 
both  a  statement  of  research  objectives  and  a  summary  of  research  efforts  for  research 
projects  listed.  Faculty,  research  staff,  students  and  others  who  participated  in  these 
projects  are  identified  at  the  beginning  of  each  project,  along  with  sources  of  funding. 

There  are  four  appendices  at  the  end  of  the  report:  Appendix  A  is  a  bibliography  of  RLE 
publications  and  papers  presented  by  RLE  staff  during  1992;  Appendix  B  is  a  roster  of 
current  RLE  staff;  Appendix  C  is  a  list  of  RLE  faculty  and  staff  milestones  and  honors 
received  during  1992;  and  Appendix  D  is  an  index  of  RLE  sponsors.  In  addition,  the 
Project  Staff  and  Subject  Index  provides  access  to  the  information  in  this  report. 

RLE  Progress  Report  Number  135  was  produced  by  the  RLE  Communications  Office. 
Further  inquiries  may  be  addressed  to: 


Research  Laboratory  of  Electronics 
Room  36-41 2 

Massachusetts  Institute  of  Technology 
Cambridge,  Massachusetts  02139-4307 
Tel.  (617)  253-2566 
Fax  (617)258-7864 
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Chapter  1.  Heterostructures  for  High  Performance  Devices 
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1.1  Introduction 

The  broad  objective  of  our  research  effort  is  to 
develop  lll-V  quantum  heterostructures  for  high  per¬ 
formance  electronic,  optoelectronic,  and  photonic 
devices  for  applications  in  high  speed  optical  com¬ 
munications  and  signal  processing.  To  this  end,  we 
are  developing:  (1)  new,  higher  performance  mate¬ 
rials  systems  including  InP-based  InGaAIAs  heter¬ 
ostructures  and  <11 1>  oriented  strained  layer 
superlattices;  (2)  novel  approaches  to  integrate 
laser  diodes  on  VLSI-level  electronic  integrated  cir¬ 
cuits;  (3)  a  new  family  of  quantum-well-base, 
tunnel-barrier  n-n-n  transistors  and  near-  and  far- 
infrared  optoelectronic  devices;  and  (4)  new 
damage-free  in  situ  processing  techniques  for  fabri¬ 
cating  advanced  quantum  structure  and  embedded 
heterostructures. 

The  following  sections  describe  our  progress  during 
the  past  year  in  the  above  research  areas.  Our 
group  works  closely  with  Professors  Hermann  A. 
Haus,  Erich  P.  Ippen,  and  James  G.  Fujimoto  to 
develop  the  optical  device  application,  characteriza¬ 
tion,  and  modeling  aspects  of  this  program,  with 
Professor  Henry  smith  to  develop  a  novel  distrib¬ 
uted  feedback  laser  structure,  and  with  Professor 
Sylvia  T.  Ceyer  to  develop  new  in  situ  processing 
techniques. 


1.2  MBE-Grown  InGaAIAs  Laser  Diodes 
for  Optical  Fiber  Communication 
Applications 

Sponsors 

DARPA/NCIPT 

Joint  Services  Electronics  Program 
Contract  DAAL03-92-C-0001 

Project  Staff 

Woo-Young  Choi,  Professor  Clifton  G.  Fonstad,  Jr. 

Semiconductor  laser  diodes  with  the  lasing  wave¬ 
length  of  1 .3  to  1 .5  pm  are  an  essential  element  of 
long-distance  optical  fiber  communication  systems 
that  have  revolutionized  modern  telecommunication 
networks.  The  material  system  of  InGaAsP  is  most 
often  used  for  laser  diodes  for  such  applications. 
We  are  investigation  the  feasibility  of  alternate 
material  system  of  InGaAIAs  for  the  same  applica¬ 
tion. 

The  main  advantage  of  InGaAIAs  over  InGaAsP  is 
that  quantum  wells  based  on  InGaAIAs  have  larger 
conduction  band  and  smaller  valence  band  offsets. 
For  high  speed  modulation  the  hole  transport  can 
be  a  limiting  factor;  over-confinement  of  holes  in 
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InGaAsP  quantum  wells  is  undesirable.  The  other 
advantage  of  this  material  is  that,  unlike  InGaAsP, 
InGaAIAs  can  be  successfully  grown  by  solid 
source  MBE  without  toxic  gases.  This  "environ¬ 
mental"  factor  will  undoubtably  become  increasingly 
important  as  the  price  of  disposing  of  environment 
pollutants  grows. 

The  advantages  listed  above  are  not  realized 
without  a  few  challenges.  First,  the  high  per¬ 
centage  of  aluminum  in  the  device  structure  can 
trap  impurities  that  can  reduce  the  optical  quality  of 
the  resulting  materials.  To  reduce  this  posibility  as 
much  as  possible,  we  have  investigated  the  optical 
quality  of  materials  obtained  under  various  growth 
conditions  and  have  determined  optimal  growth 
conditions  for  each  device  structure  desired. 
Although  we  plan  to  do  more  growth-related 
studies,  so  far  we  have  demonstrated  device  perfor¬ 
mances  of  InGaAIAs  quantum  well  laser  diodes  that 
are  comparable  to  those  produced  with  InGaAsP. 

Second,  it  is  essential  that  laser  devices  for  long¬ 
distance  optical  fiber  communication  applications 
operate  with  single  longitudinal  modes.  Since 
InGaAIAs  is  not  suitable  for  regrowth,  techniques 
that  are  commonly  used  to  achieve  single  mode 
selection  cannot  be  applied  to  MBE-grown 
InGaAIAs.  We  are  developing  a  novel  ridge-grating 
distributed  feedback  (DFB)  process  in  which  grat¬ 
ings  are  made  on  the  side  walls  of  ridges,  removing 
the  need  for  regrowth  (see  following  section).  For 
this  process,  we  are  collaborating  with  Professor 
Henry  I.  Smith's  group  at  MIT  since  the  x-ray  lithog¬ 
raphy  technique  he  and  his  colleagues  have  devel¬ 
oped  are  particular  applicable  for  this  process. 

In  addition  to  the  above  issues,  we  are  also  investi¬ 
gating  the  optimal  use  of  strained  multiple-quantum 
wells  for  the  laser  diode  applications  and  reliable 
ridge  waveguide  processing  techniques. 


1 .3  Design  and  Fabrication  of 
Distributed  Feedback  (DFB)  InGaAIAs 
Laser  Diodes  Grown  by  Molecular  Beam 
Epitaxy 

Sponsors 

DARPA/NCIPT 

Joint  Services  Electronics  Program 
Contract  DAAL03-92-C-0001 

Project  Staff 

Woo-Young  Choi,  Professor  Clifton  G.  Fonstad,  Jr. 
in  collaboration  with  Professor  Henry  I.  Smith 


it  is  essential  that  laser  diodes  for  optical  fiber  com¬ 
munications  applications  operate  with  a  single  oscil¬ 
lation  frequency.  Typically,  schemes  of  distributed 
feedback  (DFB)  or  distributed  Bragg  reflector  (DBR) 
are  used  in  which  gratings  above  or  below  the 
active  layer  (DFB)  or  outside  the  active  layer  (DBR) 
perform  the  act  of  frequency  selection.  To  make 
such  a  device  structure,  either  epitaxial  growth  is 
initiated  on  a  corrugated  substrate  or  regrowth  is 
performed  on  the  epitaxial  material  onto  which  grat¬ 
ings  have  been  formed. 

In  this  project,  we  are  pursuing  a  DFB  device  struc¬ 
ture  in  which  gratings  are  formed  entirely  after  com¬ 
plete  epitaxial  growth.  Our  motivation  for  this 
approach  stems  from  the  fact  that  we  are  working 
on  laser  diodes  grown  by  conventional  solid-source 
molecular  beam  epitaxy  (MBE)  technology.  Ddue 
to  its  growth  kinetics,  successful  MBE  growth  on 
the  corrugated  surface  is  very  difficult.  The  present 
structure  is  also  more  suitable  for  use  with  the 
InGaAIAs  system  (see  preceeding  section). 

We  have  proposed  a  ridge  stripe  structure  in  which 
gratings  are  made  on  the  side  walls  of  the  ridge  as 
well  as  on  the  bottom  channels  next  to  the  ridge. 
According  to  our  initial  calculation,  there  is  enough 
coupling  between  the  optical  wave  and  gratings  to 
result  in  the  single  mode  selection. 

To  realize  such  a  device  structure,  we  are  collab¬ 
orating  with  Professor  Smith’s  group  and  utilizing  its 
x-ray  lithography  technology.  Currently,  we  are  in 
the  process  of  combining  x-ray  lithography  tech¬ 
nology  with  ridge-strip  laser  diode  fabrication  tech¬ 
nology. 


1.4  Feasibility  Study  of  1.55 
Intersubband  Transition  in  InGaAs/AIAs 
Quantum-Well  Heterostructures 

Sponsors 

Joint  Services  Electronics  Program 
Contract  DAAL03-92-C-0001 
National  Science  Foundation 
Toshiba  Corporation  Ltd. 

Project  Staff 

Yuzo  Hirayama,  Jurgen  H.  Smet,  Professor  Clifton 
G.  Fonstad,  Jr.  in  collaboration  with  Professor 
Erich  P.  Ippen  and  Professor  Qing  Hu 

Realization  of  a  1.55  pm  intersubband  transition 
using  InGaAs/AIAs  quantum  wells  is  an  interesting 
topic  for  basic  physics  as  well  as  for  its  potential 
applications.  Very  fast  relaxation  times  and  large 
optical  nonlinearlities,  which  are  useful  for 
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advanced  photonic  devices  such  as  a  high  speed 
photonic  switch,  are  expected.  A  wavelength  of 
1.55  (xm  is  most  suitable  for  optical  communication 
systems.  We  plan  to  study  carrier  dynamics  in 
these  quantum  well  structures  using  femtosecond 
pump  and  probe  techniques. 

Structures  in  which  the  energy  separation  between 
the  first  and  fourth  subband  corresponds  to  a  wave¬ 
length  of  1 .55  pm  were  designed  and  fabricated  on 
InP  substrates  by  MBE  techniques.  Strain  compen¬ 
sated  structures  were  also  fabricated.  To  reveal 
subband  structures  in  these  quantum  well  samples, 
a  conventional  optical  absorption  measurement 
system  and  a  newly  developed  photomodulated 
transmission  measurement  system  were  built.  The 
latter  system  is  basically  a  modulation  spectroscopy 
method,  having  a  high  signal-to-noise  ratio.  There¬ 
fore,  it  is  suitable  for  measurements  of  very  narrow 
single  quantum  well  structures. 

A  careful  study  of  InGaAs/AIAs  quantum  welt 
samples  revealed  the  following  facts:  (1)  Samples 
show  many  unidentified  optical  signals  unless  thick 
InGaAs  or  InGaAIAs  buffer  layers  are  used.  (2)  Dis¬ 
locations  due  to  strain  relaxation  are  observed 
unless  growth  is  performed  at  low-substrate  temper¬ 
ature.  (3)  Transitions  from  the  first  to  the  second 
subband  were  observed.  However,  transitions  from 
the  first  to  the  fourth  subband  were  too  weak  to  be 
detected.  (4)  Peak  positions  in  the  transmission 
spectra  are  difficult  to  predict  so  that  simple  effec¬ 
tive  mass  approximation  cannot  be  applied.  A 
modified  theory  including  other  effects  such  as 
band  bending  or  electron-electron  exchange  inter¬ 
actions  cannot  account  for  the  results. 

To  realize  a  1.55  pm  intersubband  transition,  we 
have  redesigned  our  quantum  well  structure  using 
our  previous  experimental  data.  First  observation  of 
1.55  pm  intersubband  transition  will  be  in  the  near 
future.  Theoretical  work  is  also  on  going.  To 
measure  time  resolved  optical  spectra,  a  new  pump 
and  probe  system  using  white  probe  light  is  being 
built.  In  this  system,  strong  optical  pulses  which 
are  amplified  in  a  NaCI  color  center  crystal  go  into 
a  nonlinear  material  and  generate  white  light.  It  is 
expected  that,  with  the  use  of  this  setup,  the  carrier 
dynamics  in  these  new  quantum  well  structures  will 
become  clear. 


1.5  Growth  and  Characterization  of 
High  Quaiity  inGaAIAs  Muitiple 
Quantum  Weils 

Sponsor 

Joint  Services  Electronics  Program 
Contract  DAAL03-92-C-0001 

Project  Staff 

Woo-Young  Choi,  Yuzo  Hirayama,  Professor  Clifton 
G.  Fonstad,  Jr. 

Use  of  strained  layers  in  fieterostructure  device 
research  is  one  of  the  most  significant  develop¬ 
ments  in  the  past  decade.  Strained  layers  are  now 
routinely  utlitized  for  enhancing  the  performance  of 
optical  and  electrical  devices,  yielding  lower 
threshold  currents  and  faster  modulation  speeds  for 
quantum  well  laser  diodes  and  higher  electron 
mobilities  for  heterostructure  FET  devices. 
However,  this  enhanced  performance  can  be 
obtained  only  if  the  strained  layers  do  not  have  any 
dislocation  caused  by  strain  relaxation.  Conse¬ 
quently,  understanding  the  strain  relaxation  mech¬ 
anism  and  a  careful  control  of  epitaxial  growth 
parameters  are  required  for  successful  utilization  of 
strained  layers  in  heterostructure  devices.  With  this 
in  mind,  we  have  performed  intensive  studies  in 
which  MBE-grown  InGaAIAs  strained  multiple- 
quantum-wells  on  InP  are  characterized  by  double 
crystal  x-ray  diffraction,  transmission,  and  photolum¬ 
inescence  measurements.  The  strained  quantum 
wells  under  study  have  great  potential  for  applica¬ 
tions  in  laser  diode  and  modulator  devices. 

Because  the  wavelength  of  x-rays  is  shorter  than 
the  lattice  constants  of  materials  being  studied  and 
the  high  beam  quality  obtained  in  the  double  crystal 
configuration,  double  crystal  x-ray  diffraction  is  the 
best  characterization  method  for  obtaining  structural 
information  of  strained  quantum  wells.  The  degree 
of  strain  relaxation  can  be  qualitatively  determined 
by  the  sharpness  of  satellite  peaks  caused  by  the 
well  and  barrier  periodicity.  For  coherent  strained 
layers  without  major  strain  relaxation,  we  have 
developed  a  systematic  method  of  determining 
layer  composition  and  thickness  in  strained  multiple 
quantum  wells  based  on  the  amount  of  satellite 
peak  separation  and  other  growth  information. 

From  the  room  temperature  transmission  measure¬ 
ments  on  strained  multiple-quantum-well  samples, 
we  routinely  observe  excitonic  transitions,  indicating 
the  high  quality  of  strained  quantum  wells  that  are 
achieved  in  our  lab.  In  addition,  from  the  careful 
analysis  of  the  spectra,  we  have  estimated  impor¬ 
tant  parameters  such  as  exciton  binding  energy  and 
radius,  in-plane  reduced  effective  mass,  etc. 
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From  careful  analyses  of  photoluminescence 
spectra,  we  have  determined  qualitative  differences 
in  the  nature  of  luminescence  peaks  that  are  due  to 
strain-relaxed  quantum  wells,  bound  or  free  exc- 
itons,  and  free  carriers.  These  identifications  and 
their  characteristic  dependence  on  temperature 
have  enabled  us  to  correctly  interpret  the  photolum¬ 
inescence  spectra. 

These  characterization  methods  are  exciting 
research  techniques  from  which  insight  into  the 
physics  of  strained  multiple  quantum  wells  can  be 
obtained.  Furthermore,  they  allow  us  to  make  an 
easy  and  early  determination  of  qualify  of  strained 
multiple  quantum  wells  in  actual  device  structures 
and  provide  material  parameters  that  are  of  great 
importance  in  designing  and  analyzing  devices. 

1.6  New  Three-Terminal  Laser  Diodes 
with  Dynamic  Control  of  Gain  and 
Refractive  Index 

Sponsor 

DARPA/NCIPT 

Project  Staff 

Paul  S.  Martin,  Professor  Clifton  G.  Fonstad,  Jr.  in 
collaboration  with  Professor  Hermann  A.  Haus 

We  are  investigating  a  new  class  of  devices  in 
which  the  active  region  consists  of  two  quantum 
wells  with  different  widths  and  therefore  different 
optical  gain  profiles.  We  call  these  devices  indepen¬ 
dently  addressable  asymmetric  quantum  well  laser 
diodes  (lAADQW  -  LDs).  By  designing  structures  in 
which  the  current  injection  into  these  two  quantum 
wells  can  be  independently  controlled,  we  are 
gaining  a  new,  as  yet  unutilized,  degree  of  freedom 
in  controlling  light  output  from  the  device. 

Wavelength  division  multiplexing  (WDM)  is  an  area 
in  which  we  believe  this  new  class  of  devices  can 
have  an  important  impact.  An  alternative  approach 
to  a  single  very  fast  laser  diode  is  to  use  many 
slower,  and  thus  simpler,  less  expensive  and  more 
reliable  laser  diodes  all  transmitting  down  the  same 
fiber  but  at  slightly  different  wavelengths.  Since  the 
low  loss  region  of  currently  installed  silica  fiber  is 
very  broad,  this  technique  has  the  potential  to  dra¬ 
matically  increase  single  fiber  data  rates  into  the 
terabit  per  second  range  for  the  installed  fiber  base. 


Essential  for  realizing  practical  WDM  systems  are 
dynamically  tunable  laser  diodes  for  adjusting  the 
wavelength  of  each  signal  channel  and  narrow  line- 
width  channel.  Simulations  we  have  done  show 
that  InP-based  lAADQW-LDs  can  have  tuning 
ranges  in  excess  of  20  nm  (2  THz  )  within  the  1.5 
urn  low-loss  window  of  Silica  fiber.  If  realized,  this 
would  be  a  significant  improvement  over  current 
multisection  distributed  Bragg  reflector  (DBR)  type 
lasers  which  typically  have  usable  tuning  ranges  of 
only  a  few  nm.  Also  important  for  practical  applica¬ 
tions  is  the  change  in  output  power  as  the  LD  is 
tuned.  Since  the  wavelength  of  the  lAADQW  is 
determined  only  by  the  ratio  of  the  currents  injected 
into  the  two  active  regions,  the  magnitude  of  current 
injection  can  be  used  to  control  power  output, 
making  it  possible  to  keep  power  output  constant 
over  the  entire  tuning  range.  The  ability  to  elec¬ 
trically  tune  output  wavelength  over  a  broad  range 
while  keeping  output  power  constant  makes  the 
lAADQW  device  ideal  for  WDM. 

To  implement  the  lAADQW-LD  we  have  developed 
a  self-aligned  process  that  uses  a  single 
metalization  layer  to  define  the  ridge  waveguide 
through  SiCU-based  reactive  ion  etching  (RIE). 
This  layer  defines  the  current  injection  region  for 
the  lower  quantum  well  through  H+  ion  implantation 
and  contacts  the  upper  quatum  well.  One  subse¬ 
quent  photolithography  step  and  metalization  will 
then  complete  processing  for  the  device. 

Other  applications  for  these  devices  include  pure 
FM  laser  diodes  with  reduced  AM  noise,  two- 
wavelength  integrated  but  independently  controlled 
laser  diodes,  and  other  non-laser  devices  like 
tunable  narrow  bandwith  filters  and  light  modula¬ 
tors. 


1.7  Laser  Diode  Modeling  and  Design 
for  Narrow  Linewidth  Operation 

Sponsor 

Charles  S.  Draper  Laboratories 

Project  Staff 

Yakov  Royter,  Professor  Clifton  G.  Fonstad,  Jr.  in 
collaboration  with  J.H.  Hopps® 

Many  of  the  current  laser  applications  require 
narrow  linewidth  output  characteristics.  Unfortu¬ 
nately,  semiconductor  lasers,  possessing  such 
advantages  as  small  size,  direct  modulation  capa- 
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bility,  and  integrability  with  other  optical  and  elec¬ 
tronic  devices,  have  broad  linewidths  in  comparison 
to  almost  all  other  kinds  of  lasers.  The  relatively 
broad  linewidths  are  due  to  the  low  Q  of  the  cavity 
and  to  the  intrinsic  effects  specific  to  the  semicon¬ 
ductor  gain  medium.  Since  most  of  the  linewidth 
reduction  methods  have  involved  the  increase  of 
the  cavity  Q,  our  goal  was  to  investigate  and  help  in 
the  reduction  of  intrinsic  linewidth  broadening 
effects.  Reduction  of  these  effects,  in  conjunction 
with  the  increase  of  the  cavity  Q,  would  produce 
semiconductor  lasers  with  narrowest  possible  line- 
widths. 

The  first  step  in  our  research  was  to  investigate 
theoretically  the  intrinsic  linewidth  broadening 
effects.  In  particular,  we  concentrate  on  the  alpha 
parameter,  the  largest  intrinsic  linewidth  broadening 
factor,  which  describes  the  coupling  of  gain  and 
refractive  index  fluctuations  in  the  gain  medium  via 
carrier  density  fluctuations.  For  this  purpose,  we 
calculated  the  carrier  energy  bands  of  the  active 
region  semiconductor  material,  using  the  k  •  p 
approximation.  From  the  energy  bands,  we 
obtained  gain  and  refractive  index  profiles  and  line- 
width  broadening  parameter. 

To  test  our  models,  we  plan  to  compare  the  calcu¬ 
lated  laser  characteristics,  such  as  threshold 
current,  gain,  refractive  index  and  linewidth,  with 
experimental  results  obtained  from  measurements 
done  in  our  laboratory,  as  well  as  those  quoted  in 
the  literature.  Measurements  of  the  CW  semicon¬ 
ductor  laser  subthreshold  spectrum  are  being  con¬ 
ducted  from  which  gain  profile  can  be  deduced.  The 
shifts  in  the  mode  spectrum  due  to  the  change  in 
the  pumping  provide  information  on  the  gain- 
refractive-index  coupling,  and  thus  the  alpha  param¬ 
eter.  To  reduce  the  effect  of  thermal  variations,  gain 
measurements  of  pulsed  laser  spectra  will  also  be 
done.  Finally,  from  the  comparison  between  exper¬ 
imental  and  theoretical  results,  we  will  attempt  to 
establish  systematic  ways  of  obtaining  parameters 
for  our  semiconductor  laser  models. 

As  an  extension  of  our  modeling  efforts,  we  will 
also  investigate  strain  layer  quantum  well  lasers. 
Furthermore,  we  intend  to  improve  our  models  by 
including  calculations  of  energy  dependent  carrier 
lifetimes,  different  loss  mechanisms,  and  other 
sources  of  linewidth  broadening. 


1.8  Evaluation  of  GaAs  MESFET  VLSI 
Circuits  as  Substrates  for  iil-V 
Heterostructure  Epitaxy 

Sponsors 

DARPA/NCIPT 

Hertz  Foundation  Fellowship 

Vitesse  Semiconductor 

Project  Staff 

Krishna  V.  Shenoy,  Professor  Clifton  G.  Fonstad, 
Jr.  in  collaboration  with  J.  Mikkelson^ 

Commercially  available  VLSI  GaAs  metal  semicon¬ 
ductor  field  effect  transistors  (MESFETs)  are  poten¬ 
tially  very  useful  in  optoelectronic  integrated  circuits 
(QEICs).  Direct  epitaxy  of  photonic  devices  on  fully 
processed  MESFET  circuitry  is  a  viable  approach  to 
realizing  monolithic  OEICs  if  MESFETs  are 
thermally  stable  at  elevated  temperatures.  Ohmic 
contact  and  Schottky  gate  degradation,  caused  by 
thermally  induced  interdiffusion  to  the  source  or 
drain  and  channel,  respectively,  are  the  major 
modes  of  failure  in  GaAs  MESFETs  with  Au 
metallization.  In  this  research,  we  are  examining 
the  performance  of  commercial  refractory-metal 
VLSI  GaAs  MESFETs  after  3  h,  high-temperature 
anneals,  which  duplicate  the  molecular  beam 
epitaxy  (MBE)  growth  sequence  for  an  in-plane 
surface-emitting  laser  (IPSEL). 

Vitesse  Semiconductor  Corporation  HGaAs2  tech¬ 
nology,  a  self-aligned  VLSI  GaAs  MESFET  process 
with  tungsten-based  refractory-metal  Schottky 
gates,  nickel-based  refractory-metal  ohmic  contacts, 
and  aluminum  interconnection  metallization,  was 
selected  for  this  study.  Enhancement  mode 
MESFETs  (EFETs),  depletion  mode  MESFETs 
(DFETs),  and  transmission  line  model  (TLM)  struc¬ 
tures  were  selected  from  standard  production 
process  control  monitor  test  bars.  Partially  pro¬ 
cessed  wafers  (with  neither  aluminum  intercon¬ 
nection  nor  passivation  dielectric  deposited),  as  well 
as  fully  processed  wafers,  were  tested. 

MESFET  and  TLM  parameter  trends  as  the 
maximum  anneal  temperature  was  increased  from 
AQO^C  to  600‘’C  are  as  follows:  (1)  The  partially 
processed  wafer  trends  are  similar  to  the  fully  pro¬ 
cessed  wafer  trends  but  are  less  pronounced.  (2) 
EFET  and  DFET  parameter  trends  are  in  agree¬ 
ment.  (3)  Vt,  Schottky  diode  ideality,  Schottky 
barrier  voltage,  and  Rsh  show  no  significant  trends 
while  lif  and  gm  decrease,  and  Rgs  increases.  (4) 
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Qm  and  Ids  decrease  significantly  above  525°C  * 
10°C:  Rqs  increases  significantly  above  525°C  ± 
10°C.  Most  significantly,  Schottky  gate  and  ohmic 
contact  resistance  degradation  modes  are  absent. 
The  increase  in  Rqs,  which  is  the  sum  of  the 
Schottky  gate  resistance,  channel  resistance, 
source  implant  sheet  resistance,  and  ohmic  contact 
resistance,  can  be  attributed  to  an  increase  in  the 
channel  resistance  component,  and  is  responsible 
for  the  apparent  decreases  in  g^  and  Ids- 

These  results  indicate  that  if  we  can  limit  the  tem¬ 
perature  during  the  major  portion  of  a  growth 
sequence  to  525°C  or  below,  we  will  be  able  to 
grow  heterostructures  on  pre-processed  GaAs 
integrated  circuits  without  effecting  the  performance 
of  the  underlying  circuitry,  and  therefore  not 
requiring  any  modification  of  exi'^ting  design  rules 
and  circuit  simulation  models. 


1.9  Low  Temperature  Growth  of 
GaAIAs  Laser  Diodes 

Sponsors 

DARPA/NCIPT 

GTt  Laboratories 

Hertz  Foundation  Fellowship 

National  Science  Foundation  Fellowship 

Vitesse  Semiconductor 

Project  Staff 

Krishna  V.  Shenoy,  Professor  Clifton  G.  Fonstad, 
Jr.  in  collaboration  with  J.  Mikkelson®  and  B. 
Elman^ 

Direct  epitaxy  of  photonic  devices  on  fully  pro¬ 
cessed  GaAs  VLSI  circuits  to  produce  OEICc  will 
require  that  lasers  are  grown  at  525°C  or  le&o.  The 
effect  of  lowered  temperature  AIGaAs  epitaxial 
quality  on  laser  performance  is  currently  under 
investigation.  Because  conventional  AIGaAs  is 
grown  at  700°C,  which  violates  the  thermal  con¬ 
straint  imposed  by  the  electronic  circuitry. 

To  investigate  lowered  molecular  beam  epitaxy 
(MBE)  growth  temperatures  on  laser  performance, 
single  quantum  well,  step-confinement,  strained- 
layer  {AI,Ga,ln)As  laser  diode  heterostructures  were 
optimised  for  growth  at  low  temperatures.  Low  Al 
mole  fraction  cladding  layers  (1.4|im  of 
Alo  22Gao  78As)  were  used  to  minimize  oxygen  incor¬ 
poration  at  heterojunction  interfaces  and  allowed 
ridge  waveguide  lasers  to  be  fabricated  using  a 


selective  succinic  acid  etch  and  an  AlAs  etch-stop 
layer.  200  nm  GaAs  waveguide  layers  were  used 
to  maximize  the  optical  confinement  factor  and  opti¬ 
mize  the  quantum  well-to-AlGaAs  cladding  layer 
separation  which  is  critical  with  lowered  temper¬ 
ature  AIGaAs. 

A  60  lno2Gao8As  active  region  was  used  to  mini¬ 
mize  the  threshold  current;  the  corresponding  emis¬ 
sion  is  nominally  at  0.98  ^^m.  A  strained  layer 
quantum  well  active  region  yields  lower  threshold 
currents  than  unstrained  quantum  wells  because 
compressively  straining  InGaAs  decreases  the 
effective  mass  in  the  parallel  direction,  thus 
reducing  the  density  of  states  and  number  of  car¬ 
riers  required  to  reach  population  inversion.  Such 
laser  diodes  were  MBE  grown  with  a  maximum 
substrate  temperature  of  530°C  and  had  threshold 
current  densities  only  2.4  times  higher  than  lasers 
grown  under  more  optimal  growth  conditions.  This 
demonstrates  for  the  first  time  that  practical  laser 
diodes  can  be  grown  using  standard  MBE  tech¬ 
niques  at  temperatures  compatible  with  direct  epi¬ 
taxial  growth  on  fully  processed  VLSI  GaAs 
circuitry. 

Further  reduction  in  threshold  currents  are  antic¬ 
ipated.  More  exploratory  growth  techniques,  such 
as  migration  enlianced  epitaxy  (MEE),  variable  duty 
cycle  As  cell  shuttering,  and  precisely  controlled  low 
As  overpressure,  are  all  being  pursued  to  achieve 
high  quality,  low  temperature  ( <  600" C)  AIGaAs. 
Phosphide  based  optoelectronic  devices,  grown  by 
gas-source  MBE,  are  also  being  considered 
because  they  provide  both  lower  growth  temper¬ 
atures  (typically  <  500°C)  and  shorter  wavelength 
emission. 


1.10  Optical  Input  and  Output  Circuitry 
for  High  Density,  High  Speed  GaAs 
MESFET-based  OEICs 

Sponsors 

DARPA/MOSIS 
DARPA/NCIPT 
Hertz  Foundation  Fellowship 

Project  Staff 

Krishna  V.  Shenoy,  Professor  Clifton  G.  Fonstad, 
Jr.  in  collaboration  with  P.  Nuytkens 

Optoelectronic  signal  processors  have  become 
major  components  of  advanced  communication 
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systems,  real-time  signal  processors,  and  high- 
performance  sensing  systems.  Integrated  optics 
offers  the  potential  to  significantly  advance  these 
systems.  To  fully  exploit  the  potential  of  optoelec¬ 
tronic  devices,  semiconductor  processes  and  tech¬ 
nology  must  be  designed  to  integrate 
photodetectors,  transimpedance  amplifiers,  laser 
diodes,  and  VLSI  transistor  circuits.  Such 
integrated  optoelectronic  device  technology  results 
in  lower  cost,  reduced  power  consumption,  higher 
bandwidth,  lower  weight,  increased  reliability  and 
increased  performance  of  optoelectronic  systems. 

An  optoelectronic  integrated  circuit  (OEIC) 
transceiver  has  been  designed,  thoroughly  simu¬ 
lated  with  HSPICE,  and  fabricated  in  the  Vitesse 
HGaAs2  technology  (MOSIS).  The  OEIC 
transceiver  accepts  either  electrical  or  optical  signal 
inputs  and  generates  corresponding  electrical  or 
optical  signal  outputs.  The  OEIC  incorporates  an 
ECL  electrical  input  receiver  and  output  driver. 
MSM  photodetector,  with  a  companion 
transipedance  amplifier,  laser  diode  driver,  and 
opening  to  the  substrate  for  epitaxial  growth,  cross 
point  switch,  and  control  logic.  The  OEIC 
transceiver  demonstrates  four  possible  signal  con¬ 
versions:  (1)  electrical  signal  at  ECL  level  input, 
through  intermediate  DCFL  electrical  level,  to  ECL 
level  output;  (2)  electrical  signal  at  ECL  level  input 
through  intermediate  DCFL  electrical  level  to  optical 
signal  output  (via  regrown  in-plane  surface  emitting 
laser  diode);  (3)  optical  signal  input  through  inter¬ 
mediate  DCFL  electrical  level  to  electrical  signal  at 
ECL  level  output;  and  (4)  optical  signal  input 
through  intermediate  DCFL  electrical  level  to  optical 
signal  output. 

Ninety-six  processed  chips  5  mm  by  5  mm  on  a 
side,  each  with  24  OEIC  tranceiver  cells,  have  been 
received  through  the  MOSIS  service.  Each  chip 
contains  numerous  lOOum  by  400nm  opennings 
etched  through  to  the  substrate  into  which  laser 
diode  heterostructures  are  now  being  grown. 


1.11  High  Density  GaAs  MESFET-based 
OEIC  Neural  Systems 

Sponsors 

Hertz  Foundation  Fellowship 
National  Science  Foundation  Fellowship 


Project  Staff 

Krishna  V.  Shenoy,  Professor  Clifton  G.  Fonstad, 
Jr.  in  collaboration  with  A.  Grot®  and  D.  Psaltis® 

Two  components  required  implementation  of  a 
neural  network  are  neurons  and  connections.  In  an 
optical  implementation,  neurons  are  typically 
arranged  as  two-dimensional  arrays  interconnected 
via  the  third  dimension.  The  interconnections  are 
realized  with  holograms  or  spatial  light  modulators. 
Use  of  GaAs  optoelectronic  circuits  provides  the 
flexibility  of  implementing  complex  neuron  response 
functions  and  fine  tuning  properties  of  the  neurons 
as  required  by  the  neural  network  algorithm  that  is 
being  implemented. 

LEDs  and  laser  diodes  are  the  choices  for  on-chip 
light  sources.  Laser  diodes  have  higher  quantum 
efficiency  and  a  more  directed  beam  than  LEDs, 
which  means  a  higher  light  efficiency.  Unfortu¬ 
nately,  electrical  power  dissipation  is  a  limiting 
factor  for  high  density  circuits.  For  tyoical  laser 
diode  values  of  Vd  =  2  Volts  and  l,h  =  500^lA  and  a 
maximum  power  dissipation  of  1  W/cm^,  the 
maximum  integration  density  is  1000/cm=.  Because 
of  the  absence  of  a  threshold  current,  LEDs  can 
operate  with  very  small  currents  allowing  a  density 
of  up  to  lOVcm^.  Thus,  if  one  is  interested  in  high 
density  arrays  that  operate  with  relatively  slow 
switching  times  then  LEDs  are  the  preferred  choice. 

A  neuron  calcub^cs  a  simple  nonlinear  function, 
thereby  mapping  its  input  signals  to  the  output 
signal.  A  commonly  used  nonlinearity  is  the  "bump" 
function,  simply  the  derivative  of  a  sigmoidal  func¬ 
tion,  which  is  used  in  backward  error  propagation 
algorithms  for  calculation  of  the  interconnection 
weight  updates.  In  the  most  simplistic  approach  to 
implementing  the  "bump"  function,  the  output 
should  be  low  whenever  the  two  inputs  are  not 
equal  and  high  when  the  two  inputs  are  matched. 
In  this  realization,  the  input  is  a  photodetector 
current,  which  is  converted  to  a  voltage  in  a  differ¬ 
ential  input  circuit,  and  the  output  is  a  current 
through  an  LED.  This  differential  circuit  acts  as  a 
thresholding  circuit  if  the  detectors  have  l-V  charac¬ 
teristics  where  the  current  saturates  quickly.  An 
optoelectronic  integrated  circuit  (OEIC)  neural  chip 
with  arrays  of  GaAs  MESFET  neurons  and 
photodetectors  has  been  designed,  simulated,  fabri¬ 
cated  by  Vitesse,  and  tested  by  Grot  and  Psaltis  at 
the  California  Institute  of  Technology.  Monolithic 
integration  of  LEDs  in  dielectric  windows  is  the  final 
fabrication  step. 
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The  Vitesse  refractory-metal  GaAs  MESFET  circu¬ 
itry  can  withstand  three  hour  molecular  beam 
epitaxy  (MBE)  growths  at  temperatures  up  to  525°C 
±  10°C.  GaAs-AIGaAs  LEDs  are  being  grown  by 
MBE  at  MIT  on  fully  processed  neural  chips  in 
dielectric  windows,  regions  where  the  interlevel  and 
passivation  dielectrics  have  been  removed  to 
expose  the  semi-insulating  GaAs  substrate.  Pro¬ 
cessing  of  the  LEDs  is  then  completed  at  Caltech. 
After  planarizing  the  neural  chip  by  stripping  the 
poly-crystalline  material  deposited  everywhere  but 
in  the  dielectric  windows,  LEDs  are  fabricated  and 
metallically  interconnected  with  the  electronic 
neurons.  The  culmination  of  this  research  will  be 
the  demonstration  of  high-density,  monolithic  opto¬ 
electronic  neural  systems. 


1.12  Applications  of  Resonant 
Tunneling  Diodes  in  GaAs  MESFET 
VLSI 

Sponsors 

Texas  Instruments,  Inc. 

U.S.  Army  Research  Office 
Grant  DAAL03-92-G-0251 

Project  Staff 

Rajni  J.  Aggarwal  and  Professor  Clifton  G.  Fonstad 
Jr.  in  collaboration  with  P.  Nuytkens 

We  are  investigating  the  integration  of  resonant  tun¬ 
neling  diodes  (RTDs)  with  GaAs  MESFETs.  Our 
intent  is  to  use  epitaxial  regrowth  techniques  to 
integrate  RTDs  with  commercially  available  GaAs 
MESFET  technology.  We  have  considered  both  the 
development  of  a  logic  family  as  well  as  a  SRAM. 
RTDs  can  provide  stable,  low  current,  voltage 
states,  and  both  applications  are  designed  to  take 
full  advantage  of  these  properties.  Additionally,  the 
small  area  of  tunnel  diodes  gives  the  potential  for 
increasing  integration  densities.  Using  experimental 
techniques  as  well  as  HSPICE  simulations  to  eval¬ 
uate  the  circuits,  we  have  concluded  that  only  the 
SRAM  presents  a  commercially  viable  technology. 

We  compared  the  performance  of  the  RTD  logic 
family  to  that  of  direct  coupled  FET  logic  (DCFL). 
We  considered  23  stage  ring  oscillators  using  1,  2, 
3,  and  4  input  NOR  gates.  The  RTD  logic  consist¬ 
ently  dissipated  an  order  of  magnitude  less  power 
than  DCFL,  however,  much  of  this  advantage  was 
lost  when  the  delay  per  stage  was  considered. 
Overall,  the  power-delay  product  of  the  RTD  logic 
was  on  average  only  a  factor  of  four  times  smaller 
than  that  of  DCFL.  This  does  not  represent  enough 
of  an  improvement  over  DCFL  to  be  a  commercially 
viable  logic  family. 


The  SRAM  cell  consists  of  an  enhancement  mode 
FET  connected  to  two  RTDs.  An  integrated  RTDs 
and  FET  static  memory  cell  has  two  main  advant¬ 
ages  over  present  GaAs  SRAMs.  Using  epitaxial 
regrowth,  RTDs  can  be  grown  directly  on  top  of  the 
driver  FET,  reducing  the  memory  cell  size  to  that  of 
a  single  transistor.  This  design  has  the  possibility 
of  obtaining  DRAM  densities  in  a  static  memory. 
Additionally,  proper  diode  design  will  result  in  both 
the  ’’1"  and  "0"  states  being  low  power  states. 

We  have  simulated  the  basic  memory  cell  and  are 
developing  its  driver  circuitry.  We  have  exper¬ 
imentally  demonstrated  the  bistable  nature  of  diode 
chains.  Using  both  the  experimental  and  simulation 
results,  we  have  determined  a  series  of  design  con¬ 
straints  for  both  RTDs  and  FETS.  In  the  coming 
year  we  will  be  designing  and  growing  suitable 
RTDs  for  these  cells  both  in  stand  alone  form  as 
well  as  integrated  with  circuits.  An  integral  part  of 
the  integration  of  these  devices  is  the  process  flow 
required  for  manufacturing  them,  and  these  issues 
will  also  receive  attention  in  the  coming  year. 


1.13  Investigation  of  Infrared 
intersubband  Emission  from 
InGaAs/AIAs/lnP  Quantum  Well 
Heterostructures 

Sponsors 

National  Science  Foundation 
U.S.  Army  Research  Office 
Grant  DAAL03-92-G-0251 

Project  Staff 

Jurgen  H.  Smet,  Professor  Clifton  G.  Fonstad,  Jr. 
in  collaboration  with  Professor  Qing  Hu 

In  this  work,  we  are  studying  the  possibility  of 
achieving  population  inversion  between  two 
quantum  well  subbands,  with  an  electrically  pumped 
intersubband  laser  as  the  ultimate  goal.  A  triple 
quantum  well  structure  consisting  of  two  narrow 
filter  wells  and  a  wide  cente-  appears  to  be  a 
promising  structure  to  achiev-  u  s  goal.  To  avoid 
fast  relaxation  through  elec:  .  i  0-phonon  scat¬ 
tering,  it  is  desirable  to  de-:  a  -  structure  for  a 
lasing  frequency  smaller  than  the  LO-phonon  fre¬ 
quency.  In  addition,  the  required  population  inver¬ 
sion  imposed  by  the  gain  needed  to  overcome 
optical  losses  should  be  sufficiently  low  to  prevent 
the  energy  difference  between  the  quasi-Fermi  level 
in  the  upper  subband  and  the  ground  state  from 
being  larger  than  the  LO-phonon  energy.  These 
stringent  requirements  can  only  be  met  in  wide 
quanturri  well  structures. 
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To  test  the  feasibility  of  population  inversion,  the  l-V 
characteristics  at  low  temperature  of  triple  quantum 
well  structures  with  varying  barrier  thickness  and 
well  thickness  are  being  studied  to  verify  that 
injection  and  removal  of  the  electrons  from  the 
inner  well  is  not  the  current  limiting  factor,  but  that 
the  intersubband  relaxation  processes  determine 
the  current  and  therefore  that  population  inversion 
might  be  achieved. 

As  the  width  and  the  number  of  the  quantum  wells 
increases,  and  thus  the  characteristic  energy  scales 
reduce  and  the  number  of  possible  transmission 
channels  increase,  the  assignment  of  various  trans¬ 
mission  channels  to  the  currents  peaks  in  the  l-V 
characteristics  becomes  considerab'y  more  difficult 
and  ambiguous.  Of  the  techniques  used  to  analyse 
the  l-V  characteristics,  magneto-quantum  oscil¬ 
lations  observed  in  the  tunnel  current  as  a  function 
of  applied  bias  when  sweeping  the  magnetic  field 
have  proven  to  be  the  most  powerful.  Using  Fast 
Fourier  Transform  algorithms,  we  are  able  to  deter¬ 
mine  the  transmission  channel  responsible  for  a 
certain  current  peak,  the  energy  seperation 
between  the  lowest  subbands,  and  the  charge 
present  in  the  subbands  of  the  first  well.  We  can 
also  determine  that  the  GaAs-like  LO-phonon  emis¬ 
sion  process  is  the  dominant  inelastic  process  over 
InAs-like  inelastic  scattering.  This  is  in  agreement 
with  reports  on  bulk  lno53Gao.47As. 

Our  interest  in  applying  a  magnetic  field  parallel  to 
the  current  goes  beyond  its  use  as  a  diagnostic 
tool.  The  magnetic  field  reduces  the  dimensionality 
of  the  originally  2D  quantum  well  to  a  00  system 
and  should  suppress  inelastic  scattering  processes 
if  the  spacing  between  Landau  levels  in  different 
subbands  differs  from  the  LO-phonon  energy  and 
may  thus  aid  in  achieving  population  inversion.  We 
are  currently  performing  experiments  to  verify  and 
demonstrate  this  suppresion  of  inelastic  scattering. 


1.14  Infrared  Characterization  of 
InGaAs/AIAs/lnP  Quantum-Well 
Heterostructures 

Sponsor 
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The  application  of  lll-V  single  quantum  well  (SOW) 
C1-to-C2  intersubband  transitions  for  mid 
(2-5  pm)  and  far  (8-10  pm)  infrared  (IR) 
detection,  owing  to  their  fast  response  speed,  high 
detectivity,  and  reliance  on  more  mature  growth  and 
process  technologies,  have  advantages  over  con¬ 
ventional  interband  transition  materials  such  as 
mercury  cadmium  telluride  (MCT),  which  suffers 
serious  problems  of  device  uniformity  and  yield 
rate. 

By  devising  angle-  and  polarization-resolved  IR 
spectroscopy  techniques,  we  have  shown  that  con¬ 
ventional  wisdom  forbidding  normal  incidence  (TE) 
quantum  well  intersubband  transitions  is  incorrect. 
In  fact,  we  have  consistently  measured  equally 
strong  TE-  and  TM-active  intersubband  QW  absorp¬ 
tion  peaks  and  have  seen  a  strain-dependent  split¬ 
ting  between  these  peaks.  These  results  enable  us 
to  develop  a  new  theory  that  considers  the  sym¬ 
metry  effects  on  subband  Bloch  states,  which  leads 
to  the  explanation  of  polarization  selection  rules, 
absorption  strength,  and  splitting  in  our  exper¬ 
imental  work  (see  next  section). 

By  engineering  the  composition  and  width  of 
InGaAs  quantum  well  and  the  InAIAs  barrier  height, 
we  have  successfully  measuring  the  intersubband 
absorption  in  the  2  to  5  pm  wavelength  region 
based  on  InGaAs/AIAs/lnAIAs  SOW  the  resonant 
tunneling  diode  structures.  Future  work  involves 
using  MOW  structures  to  increase  photocurrent  and 
applying  selective  etches  to  contact  the  QW 
directly  to  realize  high  performance  IR  detectors. 

1.15  Analysis  of  the  Symmetry 
Properties  of  Quantum  Well  Subband 
Energy  Levels 

Sponsors 

National  Science  Foundation 
U.S.  Army  Research  Office 
Grant  DAAL03-92-G-0251 
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in  collaboration  with  R.  Victor  Jones  and  Victor 
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k  •  p  envelope  function  formalism,  due  to  its  com¬ 
putational  simplicity,  has  been  widely  used  for  cal¬ 
culating  the  quantum  well  (QW)  subband  energies 
and  predicting  the  polarization  selection  rules  of 
interband  and  intersubband  transitions.  It  has  been 
assumed  that  the  symmetry  properties  of  high-lying 
conduction  subband  Bloch  states  are  the  same  as 
in  the  ground  state;  however,  closer  examination  of 
the  first-order  k  •  p  correction  to  the  high-lying 
subband  Bloch  states  reveals  this  is  an  invalid 
assumption  because  the  narrow  width,  highly  con¬ 
finement  QW  problem  fails  all  the  criteria  that  ori¬ 
ginally  made  in  deriving  the  envelope  function 
formalism  which  assumed  a  weak,  slowly  varying 
confinement  potential  and  perturbed  state  energies 
close  to  the  band  edge. 

We  consider  the  irreducible  representations  of 
subband  Bloch  states  in  a  [001]  D2d  symmetric  QW 
structure  for  C,-to-C2  intersubband  transitions. 
Neglecting  the  QW  in-plane  crystal  anisotropy  along 
[1,1,0]  and  [1,-1,0]  directions  and  the  intersubband 
spin-orbit  coupling,  we  have  shown  that  the  C,(n=1) 
state  transforms  as  r6{ri):St,  and  the  C2(n=2) 
degenerate  states  transform  acco^g  to 
r7(r5):  (X-iY) ,  yJ2re(rsy.  (X+iY)i/72,  and 
r7(r4):  Z .,  and  that  they  are  of  Kramers  degen¬ 
eracy  in  the  absence  of  external  magnetic  field. 

Qur  model  leads  us  to  the  conclusions  that  Ct-to-C2 
intersubband  transitions  are  of  equal  absorption 
strength  for  polarization  parallel  and  perpendicular 
to  the  QW  plane;  and  that  the  r4{X,Y)  and  TsiZ) 
degenerate  states  split  in  the  presence  of  strains, 
which  explains  the  details  of  our  recent  exper¬ 
imental  work. 


1.16  Microwave  Characterization, 
Anaiysis,  and  Modeiing  of  Emitter-Down 
Heterojunction  Bipoiar  Transistors 

Sponsors 

National  Science  Foundation/Northeastern 
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Sheila  Prasad,  Professor  Clifton  G.  Fonstad,  Jr.  in 
collaboration  with  B.  Meskoob'®  and  M.  Kim" 

Emitter-down  InGaAs/lnAIAs/lnP  heterojunction 
bipolar  transistors  have  been  characterized  at 
microwave  frequencies  and  small-signal  equivalent 
circuit  models  have  been  obtained  using  the  com¬ 
mercial  Touchstone  software.  The  simulated 
annealing  (SA)  algorithm,  which  does  not  depend 
on  good  initial  conditions,  has  been  applied  to  the 
modelling  problem  and  the  results  obtained  with  SA 
and  Touchstone  were  consistent.  However,  contin¬ 
uous  human  intervention  was  required  to  obtain  the 
results  with  Touchstone,  whereas  SA  yielded  the 
results  automatically.  SA  was  therefore  proved  to 
be  the  superior  method  for  optimization  because  of 
its  speed  and  convenience. 

Small-signal  s-parameter  measurements  were 
made  by  on-wafer  probing  of  the  transistors  using  a 
Hewlett-Packard  851  OB  automatic  network  analyzer 
and  a  Cascade  Microtech  probe  station.  These 
measurements  were  made  for  up  to  39  different 
bias  points.  Consistent  small-signal  equivalent 
circuit  models  were  obtained  using  both  of  the  mod¬ 
eling  methods  described  above.  The  bias  depend¬ 
ence  of  the  elements  of  the  small-signal  equivalent 
circuit  was  also  determined,  and  it  was  found  that 
five  of  the  elements  of  the  circuit  were  highly  bias 
dependent. 

These  results  were  used  for  the  large-signal  model¬ 
ling  of  the  HBT.  The  model  obtained  was  based  on 
first-order  device  physics,  and  this  was  verified 
experimentally.  When  standard  large  signal  mod¬ 
eling  techniques  such  as  the  Harmonic  Balance 
method  are  used,  it  is  observed  that  the  first-order 
model  was  sufficeint  accurate  for  medium  power 
levels  with  respect  to  the  particular  device  under 
consideration.  At  higher  power  levels  where  the 
transistor  is  driven  into  hard  saturation  and  cutoff,  a 
more  elaborate  model  is  necessary  to  obtain  a  high 
third-order  intermodulation  product  (IP3)  and  hence 
device  linearity.  This  investigation  is  now  in  pro¬ 
gress. 
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1.17  Damage*Free  In-Situ  UHV  Etching 
and  Cleaning  of  ill*V  Heterostructures 
Using  Molecular  Beams 

Sponsors 

AT&T  Bell  Laboratories  Fellowship 
DARPA/NCIPT 
National  Science  Foundation 
Grant  ECS  90-07745 
TRW  Corporation 

Project  Staff 

Isako  Hoshino,  Professor  Clifton  G.  Fonstad,  Jr. 

Development  of  damage-free  ultra-high  vacuum 
(UHV)  etching,  cleaning,  and  regrowth  techniques 
compatible  with  molecular  beam  epitaxy  (MBE)  and 
ex-situ  processing  of  lll-V  heterostructures  is  a 
major  challenge  facing  device  researchers.  The 
ability  to  selectively  pattern,  etch,  and  overgrow 
quantum  heterostructures  is  crucial  to  the  effective 
realization  of  integrated  optical  circuitry  and 
quantum  effect  electronic  structures.  Present  tech¬ 
niques  to  do  this  involve  relatively  high  energy  (100 
eV  and  above)  ion  beans  which  cause  substantial 
sub-surface  damage,  much  of  which  is  impossible 
to  remove. 

As  a  solution  to  the  problem  of  process-induced 
damage,  we  have  begun  a  program  investigating 
the  use  of  UHV  kinetic  molecular  beam  techniques 
(widely  used  to  study  atomic  interactions)  to  etch 
and  clean  lll-V  substrates  and  heterostructures  with 
a  minimum  of  surface  damage  and  maximum  flexi¬ 
bility.  Depending  on  the  etchant  gas  mixture  estab¬ 
lished,  it  is  anticipated  that  low  energy  (0.5  to  3.0 
eV)  kinetic  beams  can  be  used  to  (1)  both  direc¬ 
tionally  and  isotropically  etch-pattern  lll-V  hetero¬ 
structure  wafers  with  no  damage:  (2)  clean  surfaces 
allowing  epitaxial  growth  on  wafers  that  have  been 
removed  from  the  UHV  environment  for  external 
processing;  and  (3)  selectively  remove  masking 
materials  and  clean  surfaces  suitable  for  subse¬ 
quent  overgrowth. 

Currently,  the  UHV  Kinetic  Beam  Etching  (KBE) 
chamber  construction  designed  to  use  a  methane- 
hydrogen  gas  mixture  in  a  supersonic  beam  source 
is  near  completion.  The  present  design  also  allows 
the  chamber  to  be  connected  to  the  existing  Riber 
2300  solid  source  MBE  system  through  a  transfer 
mechanism  of  special  design. 

The  initial  function  tests  of  the  KBE  system  are 
scheduled  to  be  performed  in  the  near  future.  Sub¬ 
sequently,  a  full-scale  characterization  of  the  ability 
of  the  KBE  will  be  performed,  mainly  concentrating 


on  etch  rate,  etch  profile,  and  surface  damage 
assessment.  The  extent  of  any  surface  damage 
will  be  determined  through  photoluminescence  and 
carrier  mobility  measurements  using  various  lll-V 
heterostructures  grown  by  MBE. 
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2.1  Introduction 
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Texas  Instruments 

The  goal  of  this  project  is  to  investigate 
lnAIAs/n*-lnGaAs  Metal-lnsulator-Doped  channel 
Field-Effect  Transistors  (MIDFETs)  on  InP.  These 
devices  are  of  great  interest  for  applications  in  long- 
wavelength  lightwave  communication  systems  and 
ultra-high  frequency  high-power  microwave  tele¬ 
communications. 

InAIAs/InGaAs  Modulation-Doped  Field-Effect 
Transistors  (MODFETs)  on  InP  represent  a  prom¬ 


ising  choice  for  a  variety  of  microwave  and  pho¬ 
tonics  applications.  The  outstanding  transport 
properties  of  InGaAs  have  yielded  devices  with  very 
low-noise  and  high-frequency  characteristics.* 
Unfortunately,  the  low  breakdown  voltage  of 
InAIAs/InGaAs  MODFETs  on  InP  (typically  less 
than  5  V)  severely  restricts  their  use  in  medium- 
and  high-power  applications.*  Additionally,  in  InP 
photonics  receivers  based  on  Metal-Semiconductor- 
Metal  (MSM)  photodiodes,  one  must  use  a  separate 
high  voltage  supply  to  operate  the  photodetectors 
because  they  typically  require  several  volts  across 
them  to  achieve  a  high-quantum  efficiency.* 

A  device  strategy  with  great  potential  for  power 
handling  is  the  InAIAs/n^-InGaAs  MIDFET  featuring 
an  undoped  insulator  and  a  thin,  heavily-doped 
channel.*  Our  research  over  the  last  few  years  at 
MIT  on  the  physics  and  technology  of  this  device 
has  revealed  that  its  breakdown  voltage,  Vb,  is 
large  and  can  be  engineered  using  pseudomorphic 


'  Fujitsu  Laboratories.  Atsugi,  Japan. 

2  L.D.  Nguyen,  L.E.  Larson,  and  U.K.  Mishra,  "Ultra-High-Spood  Modulation-Doped  Field-Effect  Transistors:  A  Tutorial  Review,’  Proc. 
IEEE  90(A):  494-518  (1992). 

3  J.H,  Burroughes  and  M.  Hargis,  "1.3  um  InGaAs  MSM  Photodetector  with  Abrupt  lnGaAs.'lnAIAs  Interface,"  IEEE  Phot.  Tech.  Lett. 
3(6):  532  (1991). 

*  J.A.  del  Alamo  and  T.  Mizutani,  "An  ln,uAloMAs/n'-ln,uGac.7As  MISFET  with  a  Heavily-Doped  Channel,"  IEEE  Electron  Device  Lett. 
EDL-8(11);  534-536  (1987). 
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insulators®  and  quantized  channels.®  This  is  in  addi¬ 
tion  to  having  a  comparable  frequency  response  to 
MODFETs  of  similar  gate  lengths.^ 

During  1992,  we  investigated  the  physics  of  break¬ 
down  in  lnAIAs/n*-lnGaAs  MIDFETs.  We  devel¬ 
oped  a  hypothesis  for  the  breakdown  mechanism  in 
this  device  that  explains  our  experimental  observa¬ 
tions  to  date.  We  verified  our  hypothesis  by  carrying 
out  detailed  breakdown  measurements  as  a  func¬ 
tion  of  temperature.  Our  work  required  the  develop¬ 
ment  of  a  breakdown  measuring  technique  that  was 
unambiguous,  safe,  and  reliable.  The  following 
sections  describe  in  greater  detail  our  technical 
findings  and  conclusions. 


2.2  Drain-current  Injection  Technique 
for  the  Measurement  of  Breakdown 
Voltage 

Precise  knowledge  of  the  breakdown  voltage,  Va.  of 
a  device,  including  an  unambiguous  definition  of 
breakdown  and  a  reliable  and  safe  measuring  tech¬ 
nique,  is  essential  for  application  of  a  device  in  a 
circuit  environment.  However,  the  FET  literature 
contains  many  inconsistent  definitions  of  break¬ 
down.  In  addition,  most  measuring  techniques  are 
ambiguous,  not  amenable  to  automation,  and  can 
easily  result  in  device  destruction.  In  this  section,  a 
new  technique  of  measuring  the  breakdown  voltage 
of  FETs  is  presented.  Its  application  to 
lnAIAs/n*-lnGaAs  MIDFETs  is  demonstrated. 

This  new  technique,  drain-current  injection  tech¬ 
nique,  uses  a  semiconductor  parameter  analyzer 
HP4145B  with  the  device  biased  in  a  configuration 
schematically  illustrated  in  figure  1.  To  measure 
breakdown,  a  fixed  predefined  current  is  injected 
into  the  drain,  the  gate-source  voltage  is  ramped 
down  from  a  strong  forward  bias  to  below  threshold, 
and  Vds  and  Iq  are  monitored.  In  this  manner,  the 
device  goes  from  the  linear  regime  through  the  sat¬ 
uration  region  and  into  breakdown.  This  is  analo¬ 
gous  to  tracing  the  locus  of  Vos  versus  Vqs  at  a 
fixed  Id  on  the  output  l-V  characteristics.  As  illus¬ 
trated  below,  the  drain-source  breakdown  voltage 
Vb(d-s)  is  the  peak  drain-source  voltage  obtained  in 


Figure  1.  Schematic  circuit  diagram  for  implementation 
of  the  drain-current  injection  technique  for  measuring 
breakdown  voltage  in  FETs. 

this  scan,  and  the  drain-gate  breakdown  voltage 
Vb<d-g)  is  the  drain-gate  voltage  at  Iq  -  -  Id- 

We  illustrate  this  technique  on  one  of  our  devices  in 
figure  2,  which  is  a  plot  of  Vos,  Vdg.  and  Iq  versus 
Vqs  for  Id  - 1  mA/mm  at  room  temperature.  For 
values  of  Vgs  above  V,  -  -  0.6V,  both  Vos  and  Vdg 
are  relatively  small  and  Iq  -  0.  At  Vqs  around  V,, 
both  Vds  and  Vdg  rise  sharply  and  Ig  starts 
becoming  significant.  At  about  Vqs  -  -  1  V  ,  Vds 
peaks.  This  is  the  drain-to-source  breakdown 
voltage  of  the  device.  At  a  more  negative  Vqs.  Ig 
becomes  1  mA/mm,  and  the  source  current 
becomes  zero.  This  defines  the  drain-to-gate  break¬ 
down  voltage  Vb(d-g).  Because  of  the  well-defined 
nature  of  these  two  critical  voltage  points,  they  can 
be  extracted  using  a  computer,  making  this  tech¬ 
nique  easily  automatable. 

Our  drain-current  injection  technique  also  provides 
insight  into  the  physics  of  breakdown.  This  is  dis¬ 
cussed  in  more  detail  in  the  next  section  of  this 
chapter.  Figure  2  shows  that  when  this  particular 
device  reaches  breakdown,  the  source  current  goes 
to  zero  and  Vdg  saturates.  This  shows  that  the 
device  breakdown  is  being  governed  by  a  drain-to- 


5  S.R.  Bahl,  W.J.  Azzam,  and  J.A.  del  Alamo,  "Strained-Insulator  InAI,  .As/n'-lnouGacrAs  Heterostructure  Field-Effect  Transistors," 
IEEE  Trans.  Electron  Dev.  38  (9):  1986-1992  (1991). 

6  S.R.  Bahl  and  J.A.  del  Alamo,  "Breakdown  Voltage  Enhancement  from  Channel  Quantization  in  InAlAs/n  -InGaAs  Heterostructure 
Field-Effect  Transistors,"  IEEE  Electron  Dev.  Lett.  13(2);  123-125  (1992). 

7  S.R.  Bahl,  B.R.  Bennett,  and  J.A.  del  Alamo,  "Doubly-Strained  ln„.Alo»As/n*-lrv,„Gaoi5As  HFET  with  High  Breakdown  Voltage,"  IEEE 
Electron  Dev.  Lett.  14(1):  22-24  (1993). 
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lo  ■  1  mA/mm 
T  .  300K 


Figure  2.  Illustration  of  drain-current  injection  technique 
in  InAIAs/n'-InGaAs  HFET:  Vcxs,  Vos.  and  Ig  versus  Vgs- 
The  peak  of  Vos  is  Vb(o-s).  and  the  point  at  which 
Ig-  1  mA/mm  is  Vb<o-g). 

gate  breakdown  mechanism  ratner  than  by  a 
channel  or  source-to-drain  breakdown  phenom¬ 
enon.  This  is  not  always  the  case,  and,  in  fact,  at 
very  low  temperatures,  most  of  our  devices  are 
governed  by  a  channel  breakdown  mechanism. 

To  conclude  this  section,  we  have  presented  a  new 
method  of  measuring  the  breakdown  voltage  in 
FETs.  This  simple  technique  provides  unambig¬ 
uous  determination  of  Vb(o-s)  and  Ve(o-G)-  The 
current  through  the  device  is  limited  to  a  prespeci¬ 
fied  value  so  that  risk  to  fragile  devices  is  mini¬ 
mized. 


2.3  Physics  of  Breakdown  in 
InAIAs/n^-InGaAs  HFETs 

We  have  performed  a  detailed  study  of  the  physics 
of  breakdown  of  lnAIAs/n*-lnGaAs  MlDFETs.  In 
brief,  our  research  reveals  that,  similar  to 
heterojunction  avalanche  photodiodes,  breakdown 
in  these  devices  is  a  two-step  process  (see  figure 
3).  First,  electrons  are  injected  from  the  gate  into 
the  channel  through  thermionic  emission  or 
thermionic-field  emission.  Second,  the  electrons 
entering  the  channel  are  very  hot  and  release  their 
energy  in  an  impact-ionization  process  which  starts 
avalanche  breakdown  in  the  channel.  This  hypoth¬ 
esis  explains  our  experimental  obsenrations  to  date 
and  also  the  low  breakdown  voltage  of 
InAIAs/InGaAs  MODFETs. 


Figure  3.  Schematic  illustration  of  the  postulated  break¬ 
down  process  in  lnAIAs/n*-lnGaAs  MlDFETs.  With  a 
strong  reverse  bias  between  the  gate  and  the  drain,  hot- 
electrons  are  injected  from  the  gate  to  the  channel 
through  a  thermionic  or  thermionic-field  emission  process 
(step  1),  and,  as  they  relax  in  the  channel,  they  cause 
impact-ionization  in  the  channel  (step  2). 

The  devices  used  in  this  study  are  based  on  a 
MBE-grown  heterostructure  on  S.I.-InP.  As  shown 
in  figure  4,  their  structure  consists  of  (bottom  to 
top).  a  1000  A  InoszAlowAs  buffer,  a 
75  A  lno53Gao47As  subchannel,  a  100  A  n* 
(4x  10’®  cm-3)  -  lno  53Gao  47As  channel,  a  300  A 
undoped  lno4iAlo59As  insulator,  and  a  50  A 
undoped  I  no  536^47  As  cap.®  HFETs  with 
Lg  - 1 .9  pm  and  Wg  -  30  pm  were  studied.  Both 
drain-source  breakdown  voltage,  Vb(d-s).  and  drain- 
gate  breakdown  voltage,  Vb(d-g).  were  measured 
using  the  drain-current  injection  technique  pre¬ 
sented  in  the  previous  section  as  a  function  of  tem¬ 
perature. 

At  approximately  room  temperature,  the  breakdown 
voltages  show  a  negative  temperature  coefficient 
with  Vb(d-g)  increasing  from  12.8  V  at  360  K  to  24.7 
V  at  260  K  (see  figure  5).  Vb(d-s)  tracks  Vb(d-g). 
and  increases  from  11.1  V  to  23.5  V.  Measure¬ 
ments  of  Ig  showed  that  for  the  temperature  range 
260  K-360  K  at  drain-source  breakdown,  all  the 


8  S.R.  Bahl,  8.R.  Bennett,  and  J.A.  del  Alamo,  "A  High-Voltage,  Doubly-Strained  lno„Alo»As/n--lnoMGao3»As  HFET,"  paper  presented  at 
the  Fourth  International  Conference  on  InP  and  Related  Materials,  Newport,  Rhode  Island,  April  20-24, 1992,  pp.  222-225. 
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Figure  4.  Schematic  of  device  heterostructure  used  in 
our  breakdown  studies. 


injected  current  flows  from  drain  to  gate  (see  figure 
5).  At  these  temperatures,  drain-gate  breakdown 
limits  the  value  of  the  drain-source  breakdown 
voltage.  The  temperature  dependence  that  is  meas¬ 
ured  is  also  incompatible  with  a  simple  impact 
ionization  process  in  the  channel. 

To  understand  the  relevant  physics,  we  examined 
in  detail  the  temperature  dependence  of  Iq  at  the 
onset  of  breakdown.  An  Arrhenius  plot  of  ln(lG/T*) 
was  generated  for  Vdg  steps  from  below  Vt  (-0.8  V) 
through  Vb(o-g)  at  300  K  (16  V),  as  shown  in  figure 
6.  Data  from  the  locus  of  the  drain-current  injection 
technique  (Id-1  mA/mm)  was  used.  The  gate 
current  was  found  to  be  thermally  activated  with  an 
activation  energy  of  0.41-0.46  eV,  regardless  of 
Vdg-  This  value  is  consistent  with  electron 
thermionic-field  emission  across  the  barrier  as  the 
breakdown  limiting  mechanism.  The  relatively  con¬ 
stant  activation  energy  exhibited  from  below 
threshold  until  breakdown  implies  that  the  voltage 
supported  by  the  insulator  does  not  change  once 
the  channel  has  been  depleted.  Instead,  the  excess 
voltage  is  supported  by  the  lateral  fields  in  the 
drain-gate  gap.  Measurements  using  a  gated 
Hall-bar  structure  (essentially  a  long  gate-length 
FET  with  taps  directly  into  the  channel)  confirmed 
this  hypothesis. 


Figure  5.  Orain-to-source  breakdown  voltage,  drain-to- 
gate  breakdown  voltage,  and  gate  current  at  drain-to- 
source  breakdown  voltage  versus  temperature. 


340  320  300  iN  360  240 


Figure  6.  Arrhenius  plot  of  gate  current  as  a  function  of 
VoQ.  The  corresponding  activation  energy  is  indicated  in 
the  diagram. 

The  simultaneous  occurrence  of  impact-ionization 
was  established  by  detecting  holes  directly  with  a 
negatively  biased  sidegate  in  a  specially  designed 
structure,  as  shown  in  figure  7.  The  drain-current 
injection  technique  at  Id  -  1  mA/mm  was  used,  and 
the  sidegate  and  gate  currents  monitored.  The  rise 
and  saturation  of  the  sidegate  current  coincides 
with  the  rise  and  saturation  of  the  drain-gate 
voltage  and  drain-gate  current.  Since  a  very  nega¬ 
tive  voltage  has  been  applied  to  the  sidegate,  only 
holes  can  contribute  to  the  sidegate  current.  This 
obsen/ation  therefore  verifies  the  occurrence  of  hole 
generation  in  the  channel  in  an  impact  ionization 
process. 
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Figure  7.  Sidegate  current  Uq,  gate  current,  and  Vdg  as 
a  function  of  Vqs  in  a  sidegate  structure  for  lo  -  1  mA/mm. 
The  correlation  between  Ug  and  Ig  for  Vsg--20  V 
unmistakably  indicates  the  existence  of  holes  in  the 
channel. 

Our  hypothesis,  pictorially  sumarized  in  figure  3, 
explains  our  previously  observed  results  which 
show  that  both  the  insulator  and  channel  para¬ 
meters  affect  Vb.  Vb  increases  if  (1)  the  Schottky 
barrier  height  of  the  insulator  is  enhanced®  (lower 
thermionic  emission),  (2)  the  channel  bandgap  is 
enlarged'®  (lower  multiplication),  and  (3)  the 
channel  doping  is  decreased"  (decreased 
thermionic  emission  resulting  from  a  higher  voltage 
across  the  insulator  at  threshold).  Now  our  hypoth¬ 
esis  can  be  used  as  a  guiding  principle  for  further 
improvements  of  the  breakdown  voltage  of 
InAIAs/m-InGaAs  MIDFETs. 

A  combination  of  thermionic  (field)  emission  and 
impact-ionization  is  also  likely  to  limit  the  break¬ 
down  voltage  in  InAIAs/InGaAs  MODFETs,  since  in 
these  devices,  doping  the  insulator  results  in  a 
reduced  barrier  to  electron  flow.  This  might  pose  a 
fundamental  limit  to  the  engineering  of 
InAIAs/InGaAs  MODFET  breakdown  voltages. 
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3.1  Introduction 

Current  state-of-the-art  epitaxial  growth  techniques 
employ  various  metalorganic  and  hydride  gases, 
particularly  for  high  vapor  pressure  species  such  as 
phosphorus  and  sulfur,  to  deliver  constituent 
species  to  the  substrate  surface.  Chemical  beam 
epitaxy  (CBE)  utiiizes  both  metaiorganic  (MO) 
gases  and  hydride  gas  sources;  metaiorganic 
moiecuiar  beam  epitaxy  (MOMBE)  uses  MO  gases; 
and  gas  source  moiecuiar  beam  epitaxy  (GSMBE) 
combines  hydride  gas  sources  with  the  more  typical 
solid  elementai  sources.  The  more  conventional 
growth  approach,  molecular  beam  epitaxy  (MBE), 
uses  oniy  moiecuiar  beams  derived  from  the 
thermal  evaporation  of  elemental  or  compound  solid 
sources. 

All  the  research  objectives  described  in  this  chapter 
are  concerned  with  layered  structures  composed  of 
compounds  containing  As  and  P,  or  Se,  S,  and  Te. 
The  presence  of  these  high  vapor  pressure  species 
suggests  that  many  advantages  will  be  gained 
through  fabrication  of  the  device  structures  by  the 
gaseous  source  epitaxy  approach. 

In  the  chemical  beam  epitaxy  facility  at  MIT,  epi¬ 
taxial  growth  of  both  ll-VI  and  lll-V  compound  semi¬ 
conductors  is  underway  using  all  of  the 
aforementioned  growth  techniques.  The  chemical 
beam  epitaxy  laboratory  consists  of  two  intercon¬ 
nected,  fully  operational  gaseous  source  epitaxy 
systems  along  with  several  smaller  chambers  used 
for  sample  introduction  and  in-situ 
analysis/metalization.  The  multichamber  epitaxy 
system  allows  various  heterostructures  to  be  grown 
in  a  continuous  ultrahigh  vacuum  environment.  The 
interconnection  feature  enables  an  additional 


degree  of  freedom  in  device  design  by  providing  the 
ability  to  integrate  the  ll-VI  and  lll-V  material  fami¬ 
lies  in  a  single  device.  The  lll-V  GSMBE  uses  solid 
elemental  sources  of  Ga,  In,  Al,  Si  and  Be  and 
gaseous  hydride  sources  of  arsenic  and 
phosphorus.  The  ll-VI  reactor,  on  the  other  hand, 
is  highly  flexible,  offering  MO  gas  sources,  hydride 
gas  sources,  and  solid  effusion  cell  type  sources. 
Various  constituent  species  available  in  the  ll-VI 
reactor  include  Zn,  S,  Se,  Te,  Cl,  N,  In,  Ga,  and  As. 
Figure  1  highlights  the  many  material  systems, 
based  on  ll-VI  heterostructures,  lll-V  heterostruc¬ 
tures,  or  on  a  combination  of  ll-VI  and  lll-V  semi¬ 
conductors,  which  are  available  for  exploration. 
Many  of  these  are  currently  being  fabricated  in  the 
epitaxy  facility. 
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Figure  1.  The  shaded  area  highlights  the  many  different 
ll-VI  and  lll-V  semiconductors  and  the  various  hetero- 
structure  configurations  which  are  available  for  investi¬ 
gation  by  fabrication  in  the  MIT  chemical  beam  epitaxy 
laboratory. 
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Wide  bandgap  ll-VI  materials  have  important  tech¬ 
nological  potential,  for  example,  as  short  wave¬ 
length  (visible  to  ultraviolet)  optical  sources  with 
applications  in  optical  recording  and  bright  emissive 
displays.  A  recent  key  advance  in  the  area  of  2n 
chalcogenides  involves  successful  p-type  doping  of 
ZnSe-based  semiconductors  using  nitrogen  as 
acceptor  species  (first  reported  by  Park  et  al.'). 
This  led  to  the  achievement  of  a  pn  diode  injection 
laser  operating  at  blue/blue-green  wavelengths  by 
Haase  et  al.^  and  Jeon  et  al.^  These  structures 
consisted  of  (Zn,Cd)Se  narrow  bandgap  well  layers 
with  Zn(Se,S)  cladding  barrier  layers,  although 
more  recent  structures  are  composed  of  barrier 
layers  of  (Zn,Mg)(Se,S).''  To  minimize  defect  gen¬ 
eration  within  the  active  regions  of  the  device,  as 
well  as  to  maximize  the  incorporation  of  the 
nitrogen  acceptor  species,  the  substrate  temper¬ 
atures  have  been  reported  to  be  extremely  low 
(-150-250°C).  By  employing  gas  source  epitaxy 
technologies  for  the  growth  of  the  aforementioned 
quantum  well  structures,  control  of  the  constituent 
species  via  precision  mass  flow  controllers  is  antic¬ 
ipated  to  offer  a  solution  to  the  difficulties  encount¬ 
ered  in  the  MBE  of  materials  containing  one  or 
more  high  vapor  pressure  species.  In  addition,  as 
the  growth  temperature  is  lowered,  control  of  the 
flux  ratio  will  become  even  more  crucial. 

The  use  of  a  beam-assisted  growth  approach  in 
conjunction  with  metalorganic  molecular  beam 
epitaxy  provides  us  with  the  capability  of  lateral  pat¬ 
terning  as  well  as  composition  modulation  in  the 
growth  direction,  which  cannot  be  achieved  by  con¬ 
ventional  MBE.  To  address  some  of  these 
materials-related  issues,  we  have  embarked  on  a 
ll-VI-based  research  program  which  emphasizes 
growth  by  the  gas  source  epitaxy  approach  of 
various  heterostructures  which  can  be  used  for 
visible  light  emitters. 

In  section  4.2  we  describe  the  progress  that  we 
have  made  in  understanding  the  photo-assisted 
MOMBE  growth  of  ZnSe  using  a  combination  of 
solid  and  gaseous  MO  sources  with  various  ener¬ 
gies  of  impinging  photons.  The  laser-assisted 
growth  technique  was  instrumental  in  aiding  our 
understanding  of  the  complex  surface  chemistry 


which  was  occuring  that  limits  the  growth  when 
using  particular  MO  sources.  The  ll-VI  effort  is 
complemented  by  a  new  research  program  with  a 
research  goal  of  fabricating  lattice-matched  epitaxial 
buffer  layers  of  (ln,Ga)P  (section  4.3).  The  recently 
functioning  lll-V  GSMBE  is  also  being  utilized  to 
begin  the  investigation  of  very  sophisticated 
(ln,Ga)(As,P)  waveguide  devices  for  application  as 
tunable  filters  operating  at  1.55  (xm  which  is  the 
wavelength  of  interest  for  optical  fiber  communi¬ 
cation  (section  4.4). 


3.2  PhotO‘Assisted  Metalorganic 
Molecular  Beam  Epitaxy  of  ZnSe  Using 
a  Combination  of  Solid  and  Gaseous 
Sources 

Sponsors 

3M  Company  Faculty  Development  Grant 
Defense  Advanced  Research  Projects  Agency 
Subcontracts  216-25013  and  542383 
Joint  Services  Electronics  Program 
Cor  tract  DAAL03-92-C-0001 
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The  use  of  photon  illumination  during  the  epitaxial 
growth  of  compound  semiconductors  enhances  the 
capabilities  of  conventional  growth  techniques  by 
modifying  the  surface  chemistry  and  thus  providing 
a  potential  tool  necessary  for  in-situ  selective  area 
epitaxy.  Photo-assisteo  epitaxy  iios  been  reported 
for  the  growth  of  both  narrow  and  wide  bandgap 
ll-VI  materials,  as  well  as  for  a  variety  of  lll-V  semi¬ 
conductors.  The  effect  of  photons  on  tt'e  growth  is 
dependent  on  the  precursor  selected  for  constituent 
materials  (and/or  the  type  of  growth  technology 
employed),  as  well  as  the  energy  and  flux  of  illumi¬ 
nating  photons. 

The  various  physical  mechanisms  w'  ich  have  been 
either  reported  or  verified  experimentally  to  explain 
the  effect  of  photon  illumination  include;  (1)  selec¬ 
tive  desorption  of  adsorbed  surface  species,  (2) 
pyrolysis  of  source  precursors  at  the  surface  via 


'  R.M.  Park,  M.B.  Trotfer,  C.M.  Rouleau,  J.M.  Depuydt,  and  M.A.  Masse,  "P-Type  ZnSe  by  Nitrogen  Atom  Beam  Doping  during  Molec¬ 
ular  Beam  Epitaxial  Growth."  Appl.  Phys.  Lett.  57:  2127-2129  (1990). 

2  M.A.  Masse,  J.  Qui,  J.M.  Depuydt,  and  H.  Cheng,  "Blue-Green  Laser  Diodes,"  Appl.  Phys  Lett.  59:  1272-1274  (1991). 

3  M.  Jeon,  J.  Ding,  W.  Patterson,  A.V,  Nurmikko,  W.  Xie,  D.C.  Grillo,  M.  Kobayashi,  and  R.L.  Gunshor,  "Blue-green  Injection  Laser 
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increases  in  the  suustrate  temperature,  (3)  direct 
photolysis  of  source  precursors  which  are  present 
in  the  gas  phase,  and  (4)  photo-catalysis  of  molec¬ 
ular  species  through  the  creation  of  free  carriers 
present  at  the  surface.  Similar  effects  on  the  epi¬ 
taxial  growth  process  have  been  reported  when  the 
surface  is  illuminated  with  an  electron  beam.  In  the 
case  of  an  electron  beam,  the  physical  mechanisms 
which  have  been  postulated  include  kinetically- 
induced  decomposition  of  adsorbed  precursor  mole¬ 
cules  and/or  the  presence  of  charge  carriers  which 
interact  with  surface  species. 

In  this  program,  the  effect  of  "beam-assisted" 
growth  during  the  metalorganic  molecular  beam 
epitaxy  (MOMBE)  of  ZnSe  is  under  investigation. 
By  utilizing  the  wavelength-dependent  laser- 
assisted  growth  technique  and  a  combination  of 
both  gaseous  metalorganic  and  solid  elemental 
sources  during  the  growth,  we  were  able  to  identify 
the  dominant  surface  reactions  which  result  in  sig¬ 
nificant  enhancements  in  the  growth  rate,  as  well  as 
a  growth  rate  suppression. 

ZnSe  films  were  grown  by  MOMBE  on  GaAs(001) 
substrates.  The  Zn  and  Se  p.ecursors  included 
elemental  (6N)  Zn  and  Se,  diethylzinc  (DEZn), 
dimethylzinc  (DMZn),  and  diethylselenium  (DESe). 
DEZn  and  DESe  were  initially  chosen  due  to  their 
lower  thermal  decomposition  energies  to  enable  the 
low  temperature  growth  of  ZnSe.  The  DEZn  was 
typically  utilized  as  a  metalorganic  gas,  i.e.  not 
thermally  decomposed,  with  quadrupole  mass  spec¬ 
troscopy  of  the  gas  beam  indicating  that  thermal 
decomposition  occurs  near  300°C.  In  all  cases,  the 
DESe  molecule  was  thermally  decomposed  at 
800®C  prior  to  impingement  onto  the  substrate 
surface.  The  calibrated  substrate  temperature 
ranged  from  150-475°C.  Photo-assisted  MOMBE 
was  performed  by  illuminating  a  portion  of  the 
wafer’s  surface  with  laser  radiation.  The  emission 
'ines  of  an  argon  ion  laser,  a  Ti:sappnire  laser 
(7800  A),  and  a  dye  (Coumarin  7)  laser  were  all 
used  to  investigate  the  wavelength  dependence  of 
the  photo-assisted  growth.  The  power  density  of 
the  lasers  was  kept  very  low  (<200  mW/cm^),  elimi¬ 
nating  the  possibility  of  photon-induced  thermal 
pyrolysis  of  the  metalorganic  precursors. 

The  unilluminated  growth  rate  using  DEZn  and 
DESe  precursors  was  found  *o  be  abnormally  low 
(several  hundred  angstroms  per  hour)  for  variations 
in  the  following  growth  parameters:  substrate  tem¬ 
perature,  gas  flow  rate,  the  Vl/ll  flow  rate  ratio,  the 


addition  oi  nydrogen  supplied  by  a  gas  injector,  and 
the  thermal  pyrolysis  of  each  gaseous  species 
using  a  "cracking"  gas  injector.  The  growth 
parameter-independent  low  growth  rate  suggested 
that  the  growth  rate  was  limited  not  by  mass  trans¬ 
port  to  the  substrate  surface,  but  rather  that  the  Zn 
and  Se  atoms  were  unable  to  incorporate  into  the 
crystalline  lattice.  However,  we  found  that  illumi¬ 
nation  with  photons  having  energy  greater  than  the 
energy  bandgap  of  the  ZnSe,  was  found  to  signif¬ 
icantly  Increase  the  growth  rate  under  certain  condi¬ 
tions.  To  verify  that  the  resultant  low  growth  rate 
and  high  illuminated  growth  rate  was  unique  to  the 
use  of  diethyl-based  metalorganic  sources,  ZnSe 
was  grown  with  all  combinations  of  DEZn,  DMZn, 
DESe,  and  elemental  Zn  and  Se.  Growth  rates 
obtained  using  atomic  Zn  or  DMZn  and  elemental 
Se  were  lypical  of  that  achievable  with  molecular 
beam  epitaxy  and  limited  only  by  mass  transport  to 
the  substrate  surface;  however,  when  at  least  one 
diethyl  metalorganic  was  used,  the  resultant  unillu¬ 
minated  growth  rate  was  very  low.  The  exper¬ 
imental  evidence  thus  suggested  that  the  presence 
of  ethyl  radicals  (or  a  byproduct  of  diethyl  molecular 
decomposition)  was  responsibie  for  the  low  unillu¬ 
minated  growth  rate.  The  ethyl  radicals  are 
believed  to  be  strongly  chemisorbed  to  the  ZnSe 
surface  such  that  the  incorporation  sites  for  the 
metal  atoms  were  not  available.  Relatively  high 
substrate  temperatures  were  investigated  (up  to 
475°C)  in  an  attempt  to  desorb  the  ethyl  species 
responsible  for  surface  site  saturation.  The  meas¬ 
ured  growth  rate  was  extremely  low,  -100  A/hour. 
We  speculate  that  the  desorption  rate  of  the  ethyl 
species  was  not  significantly  modified  at  the  high 
substrate  temperatures,  whereas  the  desorption  of 
Zn  and  Se  became  significant  and  dominated  the 
growth.  Work  reported  in  the  literature  describing 
the  chemical  beam  epitaxy  of  GaAs  by  Murrell  et 
al.®  has  indicated  that  a  similar  surface  site 
blockage  phenomenon  was  speculated  to  occur  at 
relatively  low  substrate  temperatures  for  GaAs. 

The  generation  of  free  carriers  at  the  growth  front 
during  MOMBE  growth  has  been  found  to  alleviate 
the  site  blockage  phenomenon  encountered  using 
DEZn  and  DESe  sources.  In  addition,  new  degrees 
of  freedom  are  provided  for  the  selection  of  the 
growth  conditions.  Depending  on  the  degree  of 
pyrolysis  of  the  Zn  source,  the  application  of  a  laser 
was  found  to  induce  either  an  enhancement  or  a 
suppression  of  the  growth  rate.  The  influence  of 
the  laser  on  the  growth  rate  for  the  various  combi¬ 
nations  of  sources  employed  is  summarized  in  table 


5  A.J.  Murrell,  A.T.S.  Wee,  D.H.  Fairbrother,  N.K.  Singh,  J.S.  Foord,  G.J.  Davies,  and  D.A.  Andrews,  "Surface  Chemical  Processes  in 
Metal  Organic  Molecular-Beam  Epitaxy:  Ga  Deposition  from  Triethylgallium  on  GaAs  (100),”  J.  Appl.  Phys.  68;  4053-4063  (1990). 
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Source  of  Zn 

Source  of  Se 

MOMBE 

Growth  Rate* 

Effect  of 
Ulumination  on 
Growth  Rate 

DEZn 

DESe 

No  Growth 

None 

DEZn 

cracked  DESe 

Low 

Enhancement 

DEZn 

elemental  Se 

Low 

Enhancement 

elemental  Zn 

cracked  DESe 

Low 

Suppression 

cracked  DEZn 

cracked  DESe 

Low 

Suppression 

1  ■  ■  1 

1  elemental  Zn  elemental  Se  High  (MBE) 

Suppression  { 

—z:— _  i 

cracked  DMZn 

cracked  DESe 

Low 

Enhancement 

cracked  DMZn 

elemental  Se 

High 

Suppression  or 
Enhancement 

•  Low;  100  -  400  A/hr.  High:  typical  of  MBE 


Table  1 .  Comparison  of  the  growth  rate  of  ZnSe  obtained  for  various  source  materials  and  with  application  of  photons 
having  energy  sufficient  to  generate  carriers. 


1.  An  enhanced  growth  rate  was  observed  for 
growth  conditions  which  resulted  in  partially 
pyrolyzed  DEZn  at  the  surface.  Thernnal  decom¬ 
position  of  the  DEZn  prior  to  delivery  to  the  sub¬ 
strate  surface  resulted  in  the  expected  low 
unilluminated  growth  rate  (as  shown  in  table  1). 
However,  in  stark  contrast  to  the  results  described 
for  the  unpyrolyzed  DEZn  molecule,  a  suppression 
of  the  illuminated  growth  rate  was  observed.  From 
these  qualitative  observations,  it  is  clear  that  the 
surface  chemistry  which  dominated  the  laser- 
assisted  MOMBE  growth  was  the  conversion  of  par¬ 
tially  pyrolyzed  DEZn  into  atomic  Zn  which  leads  to 
incorporation  into  the  lattice.  When  using  DMZn 
and  solid  Se,  we  have  measured  both  growth  rate 
enhancement  and  suppression  depending  on  the 
growth  conditions  which  result  in  various  surface 
stoichiometries.  A  laser-assisted  effect  was  also 
observed  when  a  more  "MBE-like"  growth  approach 
was  used,  i.e.,  both  metalorganic  sources  were 
cracked  and/or  solid  sources  were  employed.  In 
these  cases,  however,  the  illuminated  growth  rate 
was  measurably  suppressed  and  was  in  agreement 
with  earlier  reports  of  laser-assisted  MBE  growth  of 
ZnSe.®  We  attributed  the  growth  rate  reduction  to 
the  desorption  of  excess  Se  on  the  surface.  Vari¬ 
ations  of  the  energy  of  the  impinging  photons  in  the 
"MBE-like"  growth  approach  also  confirmed  that  the 
desorption  phenomenon  was  related  to  the  genera¬ 


tion  of  electron/hole  pairs  and  the  presence  of  free 
carriers  at  the  growth  front. 

Low  temperature  photoluminescence  (PL)  was  used 
to  assess  the  optical  properties  of  the  thin  ZnSe 
films.  Ultraviolet  radiation  from  a  He-Cd  laser  was 
used  as  the  excitation  source  while  keeping  the 
sample  temperature  near  1 0K.  Figure  2  shows  the 
PL  obtained  from  the  illuminated  (photons  having 
K- 4579  A)  and  unilluminated  regions  of  the  film 
when  cracked  DESe  and  uncracked  DEZn  were 
used  as  the  source  materials.  The  PL  data  demon¬ 
strates  the  effect  that  the  laser  has  on  the  growth. 
The  thin  ZnSe  film,  which  was  not  illuminated  by 
the  laser  (upper  curve),  is  dominated  by  deep  level 
defects  suggesting  a  highly  nonstoichiometric  layer. 
In  contrast,  the  film  grown  with  photon  illumination 
(lower  curve)  shows  practically  no  deep  level 
luminescence  and  a  remarkably  more  intense  near 
bandedge  feature,  most  likely  a  bound  exciton  with 
an  energy  of  2.796  eV.  Identification  of  the  impurity 
related  to  the  formation  of  the  exciton  is  compli¬ 
cated  by  the  presence  of  strain  and  requires  addi¬ 
tional  investigation.  Figure  3  shows  the  PL  for  a 
film  grown  with  solid  Zn  and  cracked  DESe,  at  a 
calibrated  substrate  temperature  of  230°C.  Com¬ 
parison  of  the  region  illuminated  by  the  laser  (lower 
curve)  to  the  unilluminated  region  (upper  curve)  is 
as  expected  for  an  "MBE-like"  growth  approach.  In 


6  R.L.  Gunshor,  L.A.  Kolodziejski,  A.V.  Nurmikko,  and  N.  Otsuka,  'Molecular  Beam  Epitaxy  of  ll-VI  Semiconductor  Microstructures,'  in 
Semiconductors  and  SemiMelals,  ed.  T.P.  Pearsall  (Boston:  Academic  Press,  1990). 
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particular,  the  growth  rate  is  not  increased,  but 
slightly  decreased  when  illuminated  with  photons. 
The  resultant  effect  of  the  laser  illumination  is  to 
tune  the  surface  stoichiometry.  The  PL  reflects  the 
tuning  of  the  stoichiometry  by  eliminating  the  weak 
donor-to-acceptor  pair  band  luminescence  which 
may  be  attributed  to  electron-to-acceptor-like 
vacancy  recombination.  The  photoluminescence 
measurements  provide  additional  information  sug¬ 
gesting  that  the  photon-assisted  growth  affects  pri¬ 
marily  the  metalorganic  Zn  molecule  and 
contributes  to  converting  the  DEZn  into  metallic  Zn 
allowing  incorporation  into  the  crystalline  lattice. 


Figure  2.  Photoluminescence  of  a  ZnSe  film  grown 
using  DEZn  and  DESe  at  a  calibrated  substrate  temper¬ 
ature  of  310°C  (a)  without  laser  illumination  and  (b)  with 
X- 4579  A.  Insert  shows  the  energy  of  the  dominant 
feature.  Flow  rates  of  0.5  and  2.5  seem  were  used  for 
the  DEZn  and  DESe,  respectively. 


3.3  Epitaxial  Lattice-Matched  lll-V 
Buffer  Layers  for  Wide  Bandgap  ll-VI 
Visible  Sources 
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Figure  3.  Photoluminescence  of  a  ZnSe  film  grown 
using  Zn  and  DESe  (a)  without  laser  illumination  and  (b) 
with  X  -  4579  A.  Inserts  show  the  energy  of  the  dominant 
feature. 

National  Science  Foundation 
Grant  DMR  92-02957 

To  take  advantage  of  the  numerous  opportunities 
offered  by  long-lived,  continuously  operating,  room 
temperature  visible  sources  based  on  the  wide 
bandgap  ll-VI  semiconductors,  several  key  techno¬ 
logical  barriers  must  be  overcome.  One  very 
important  difficulty  is  the  lack  of  high  quality  ll-VI 
substrate  material.  This  parti'^ular  limitation  creates 
the  need  for  the  heteroepitaxy  of  ll-VI  layers  onto 
lll-V  based  substrates.  If  bulk  substrates  are 
employed,  the  only  choice  is  to  use  GaAs  which 
requires  assessment  of  the  effects  of  lattice  mis¬ 
match  (0.2C.o)  strain  on  the  resultant  light  emitting 
device  (LED)  stmetures.  An  additional  consider¬ 
ation  is  the  need  for  electronic  confinement.  Thus, 
the  use  of  bandgap  engineering  and  application  of 
quantum  wells  is  necessary  for  the  design  of  the 
laser  or  LED  device.  Therefore,  various  ll-VI 
ternary  and  quaternary  layers  are  currently  under 
investigation,  with  the  resulting  epilayer  having  a 
significantly  varying  lattice  parameter.  Issues  of 
strain  between  both  the  substrate  material  and  ll-VI 
based  active  layer  and  between  the  well  (narrow 
bandgap)  and  barrier  (wider  bandgap)  ll-VI  alloy 
layers  are  crucial.  The  presense  of  strain  contrib- 
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utes  to  the  generation  and  propagation  of  defects 
which  will  ultimately  affect  the  lifetime  of  any 
emitter. 

Our  interconnected  epitaxy  system  provides  the 
unique  ability  to  investigate  the  growth  of  ll-VI 
quantum  well  structures  on  lll-V  epitaxial  surfaces 
where  the  lattice  parameter  ranges  from  that  of 
GaP  to  InAs.  These  material  systems  cover  the 
same  lattice  parameter  range  as  Zn(S,Se,Te)  and 
CdS,  and  all  alloy  combinations.  The  shaded 
region  in  figure  1  shows  the  range  of  lattice  con¬ 
stants  and  energy  bandgaps  that  can  be  grown  in 
the  CBE  facility.  We  are  presently  investigating  the 
growth  of  ZnSe  on  lattice-matched  epitaxial 
(ln,Ga)P  buffer  layers.  As  a  first  step,  the  growth 
conditions  required  to  lattice  match  the  (ln,Ga)P  to 
ZnSe  are  being  determined.  The  epitaxial  films  are 
examined  using  double  crystal  x-ray  diffraction  and 
low  temperature  (10  K)  photoluminescence.  To 
date,  the  full  width  at  half  maximum  (FWHM)  of  the 
(ln,Ga)P  peak  (as  determined  by  x-ray  diffraction 
rocking  curves)  has  been  measured  to  be  34  arc 
seconds,  whereas  the  GaAs  substrate  peak  had  a 
FWHM  of  26  arc  seconds.  The  photoluminescence 
spectra  of  the  (ln,Ga)P  consists  of  a  single  peak 
corresponding  to  the  near  bandedge  of  the  alloy. 
These  (ln,Ga)P  films  are  typically  grown  at  a  sub¬ 
strate  temperature  of  470°C  and  with  a  growth  rate 
between  0.25  and  1  pm  per  hour.  Initial  optical 
microscopy  observations  suggest  that  the  starting 
surface  morphology  is  featureless  (unlike  the  cross- 
hatched  surface  of  similarly  strained  (ln,Ga)As 
which  is  typically  used  for  buffer  layer?  to  ZnSe) 
and  holds  promise  for  the  minimization  of  defect 
propagation  into  the  active  ll-VI  device  layers  and 
hence,  enhancement  of  the  performance  of  ll-VI 
based  optical  sources. 


3.4  Fabrication  of  Novel  "Channel 
Dropping  Filters"  for  Wavelength 
Division  Multiplexed  Photonic  System 
Applications 

Sponsor 

Defense  Advanced  Research  Projects  Agency 
Subcontract  542383 

The  fabrication  of  novel  "channel  dropping  filters" 
for  wavelength  division  multiplexed  photonic  system 
applications  is  currently  underway.  A  channel  drop¬ 
ping  filter  consists  of  three  parallel,  but  coupled  res¬ 
onant  waveguides,  fabricated  from  (ln,Ga)(As,P)  to 
take  advantage  of  the  potential  integration  with  the 
existing  technology  that  is  used  in  optical  communi¬ 
cation  systems.  The  various  (ln,Ga)(As,P)  layers 


will  be  lattice-matched  to  InP  substrates,  but 
depending  on  the  In  and  As  mole  fractions,  will 
have  various  energy  gaps  and  hence  various 
indices  of  refraction.  We  are  presently  conducting 
experiments  to  determine  the  optimum  growth  con¬ 
ditions  for  the  individual  (ln,Ga)(As,P)  layers  using 
in-situ  reflection  high  energy  electron  diffraction,  as 
well  as  ex-situ  double  crystal  x-ray  diffraction  and 
photoluminescence.  In  addition,  device  simulations 
are  underway  to  optimize  the  filter  performance  in 
order  to  determine  (1)  the  appropriate  mole  frac¬ 
tions  of  quaternary  layers  necessary  to  vertically 
confine  the  optical  signal,  (2)  the  dimensions 
required  to  achieve  the  lateral  waveguiding,  and  (3) 
the  necessary  coupling  coefficient  between  the 
various  waveguides.  In  this  research  program,  a 
multi-investigator  approach  is  required.  Here,  we 
will  pursue  the  growth  of  the  layered  waveguide 
structure:  the  necessary  gratings  will  be  etched  in 
the  Submicron  Structures  Laboratory  under  the 
direction  of  Professor  H.l.  Smith  (see  part  I,  section 
2,  chapter  4),  and  the  theory,  device  simulation, 
and  measurements  of  filter  performance  are  being 
carried  out  in  the  Optics  and  Devices  Group  with 
Professors  H.A.  Haus  and  E.P.  Ippen  (see  part  I, 
section  3,  chapter  1). 
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4.1  Introduction 

The  focused  ion  beam  research  program  at  MIT 
has  been  mainly  aimed  at  developing  new  applica¬ 
tions.  Our  work  can  be  divided  into  two  areas;  The 
first,  served  by  a  1 50  kV  system,  is  the  high  energy 
regime  and  includes  implantation  and  lithography. 
The  second  is  the  lower  energy  regime  and  is 
aimed  at  developing  repair  processes  for  masks 
and  integrated  circuits.  It  is  served  by  two 
machines,  a  50  kV  system  mounted  on  a  home 
built  UHV  chamber  and  an  FEl  500D  system  which 
was  donated  by  the  FEl  Company  to  MIT  last  year. 
This  system  produces  Ga*  ions  between  5  and  25 
keV  at  minimum  beam  diameters  of  0.05  pm. 

The  high  energy  system  includes  automated  pat¬ 
terning  capability  over  wafers  up  to  six  inches  in 
diameter.  Alignment  of  the  focused  ion  beam 
writing  to  within  ±0.1  pm  of  existing  features  on  a 
wafer  has  been  demonstrated.  Software  has  been 
developed  which  permits  patterns  to  be  transferred 
from  the  layout  system  used  in  the  Microsystems 
Technologies  Laboratory  to  the  focused  ion  beam 
machine.  Accordingly,  this  permits  flexible,  mixed 
fabrication  where  standard  steps  have  been  carried 
out  in  the  integrated  circuits  laboratory  at  MIT,  and 
special  implantation  or  lithography  steps  have  been 
carried  out  on  the  focused  ion  beam  system. 
Similar  mixed  fabrication  has  also  been  carried  out 
on  Si  wafers  partly  fabricated  at  Ford  Aerospace 
and  on  GaAs  wafers  at  Raytheon  Research  Labora¬ 
tory.  The  ion  species  available  for  implantation 
include  the  principal  dopants  of  GaAs  and  Si.  The 


minimum  beam  diameter  available  is  on  the  order 
of  0.1  pm  at  an  ion  current  of  20  pA.  In  many  of 
the  implantation  projects  where  the  minimum  diam¬ 
eter  is  not  needed,  a  higher  beam  current  can  be 
used. 

In  the  lower  energy  regime  our  work  has  mainly 
focused  on  the  development  of  ion  induced  deposi¬ 
tion.  This  technique  uses  a  local  ambient  of  a  pre¬ 
cursor  gas,  usually  organometallic  or  metal  halide, 
to  permit  deposition  to  be  carried  out  with  minimum 
linewidth  of  0.1  mm.  The  local  patterned  deposition 
complements  material  removal  by  ion  milling  and  is 
used  to  add  missing  absorber  material  in  the  repair 
of  photomasks  and  x-ray  lithography  masks  or  to 
rewire  local  connections  in  integrated  circuits.  Our 
efforts  have  focused  on  (1)  gold,  tungsten,  and 
copper  deposition,  (2)  understanding  the  fundamen¬ 
tals  of  the  process,  and  (3)  deposition  at  non¬ 
normal  incidence.  Ga*  ions  in  the  range  of  10-50 
kV  are  mainly  used. 

4.2  Tunable  Gunn  Diodes  - 
Optimization  for  Microwave 
Applications 

Sponsor 
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A.  Antoniadis,  Dr.  John  Meingailis 

Tunable  Gunn  diodes  are  two-terminal  voltage  con¬ 
trolled  oscillators  (VCOs).  They  are  fabricated  by- 
using  a  focused  ion  beam  to  implant  a  gradient  of 
doping  in  the  direction  of  current  flow.  The  best 
performance  we  have  achieved  so  far  is  tunability 
over  20  GHz  with  a  maximum  frequency  of  42  GHz. 
To  optimize  the  performance,  we  have  recently 
implanted  a  large  array  of  devices  (over  3000) 
varying  the  dose,  geometry,  and  gradient  as  para¬ 
meters.  An  immediate  application  of  these  simple 
VCOs  is  the  built-in-test  for  monolithic  microwave 
integrated  circuits  (MMICs).  A  tunable  Gunn  diode 
is  coupled  to  the  input,  and  a  diode  detector  is 
coupled  to  the  output.  By  switching  these  two  ele¬ 
ments  into  the  circuit  and  applying  a  ramp  bias 
signal  to  the  Gunn  diode,  the  circuit  is  tested  by 
varying  the  frequency  of  the  input  signal  over  its 
entire  range  of  operation. 


4.3  Dose  Rate  Effects  in  the 
Implantation  of  Si  in  GaAs 

Sponsor 
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The  instantaneous  current  density  of  a  focused  ion 
beam  is  of  order  0.1  to  1  A/cm^.  In  normal,  broad 
beam  implantation  the  current  density  is  4  to  6 
orders  of  magnitude  lower.  We  have  shown  that 
this  high  instantaneous  current  density  can  lead  to 
increased  lattice  damage  in  some  cases  of  focused 
ion  beam  implantation.  One  way  to  mitigate  these 
effects  is  to  scan  the  beam  over  the  sample  so  as 
to  present  a  lower  average  current  density.  We 
have  found  surprising  effects  of  a  large  variations  in 
lattice  damage  as  a  function  of  the  scan  rate  of  the 
focused  ion  beam.  At  a  given  point  on  the  sub¬ 
strate  the  scan  schedule  produces  a  pulsed  implant 


of  a  selected  dwell  time  which  is  repeated  after 
some  period.  We  have  measured  the  damage 
using  Rutherford  backscattering  as  well  as  electrical 
properties.  At  a  given  dwell  time,  for  example,  the 
amount  of  damage  decreases  for  repetition  periods 
up  to  times  of  order  seconds.  This  implies  that, 
contrary  to  one's  intuition,  after  an  ion  (Si)  has  pen¬ 
etrated  into  the  solid  (GaAs),  it  does  not  come  into 
equilibrium  even  after  one  second.  In  general,  the 
lattice  would  be  expected  to  be  back  in  equilibrium 
in  times  of  order  nanoseconds. 


4.4  Limited  Lateral  Straggle  of 
Focused-lon-Beam  Implants 

Sponsor 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Army  Research  Office 
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Project  Staff 
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The  lateral  straggle  of  ions  as  they  penetrate  into  a 
substrate  limits  the  minimum  lateral  dimensions  of 
implant  profiles  that  can  be  defined  in  a  substrate. 
This  potential  spreading  of  the  profile  has  to  be 
considered  to  fabricate  devices  at  ultrasmall  dimen¬ 
sions  or  confined  carrier  structures  for  quantum 
effect  studies.  The  lateral  distribution  of  focused- 
ion-beam  implanted  Si  and  Be  atoms  has  been 
studied  by  measuring  the  electrical  resistivity  in 
grating  structures.  The  gratings  which  were  ori¬ 
ented  perpendicular  to  the  direction  of  the  current 
flow  were  implanted  with  silicon  and  beryllium  at 
280  and  260  keV  respectively.  They  were 
implanted  into  semi-insulating  materials  cut  on-  and 
off-axis,  and  then  repaid-thermal-annealed.  The 
lateral  straggle  was  found  to  be  less  than  100  nm 
for  Si  and  equal  to  190  nm  for  the  Be  implants. 
The  standard  deviation  of  the  lateral  distribution 
was  found  to  increase  with  the  dose.  This  is  attri¬ 
buted  to  a  concentration-dependent  diffusion  which 
results  in  an  anomalously  high  diffusion  coefficient. 
Comparison  of  the  experimental  parameters  of  the 
implanted  distribution  with  values  found  in  standard 
tables  or  calculated  by  a  Monte-Carlo  TRIM  code 
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seems  to  indicate  that  all  simulations  overestimate 
lateral  straggle  at  the  expense  of  penetration  depth. 


4.5  Focused  Ion  Beam  Implantation  in 
GaAs  for  Transistor  Optimization 

Sponsor 

National  Science  Foundation 
Grant  ECS  89-21728 

Project  Staff 

T.E.  Kazior/  M.H.  Cobb/  Sergey  Etchin,  Dr.  John 
Meingailis 

Focused  ion  beam  implantation  is  a  useful  tech¬ 
nique  for  optimizing  channel  implants  in  integrated 
circuits.  The  fact  that  the  implants  are  done  in  a 
maskless,  direct-write  fashion  permits  the  dose,  ion 
energy,  and  ion  species  to  be  varied  from  device  to 
device  on  the  same  wafer.  Wafer  to  wafer  vari¬ 
ations  do  not  obscure  interpretation  of  the  data.  To 
demonstrate  this,  we  have  implanted  the  Be  buried 
p-layer  beneath  the  active  Si-implanted  channel  in  a 
GaAs  MESFET.  The  Be  and  Si  implant  conditions 
were  varied  over  a  range  of  energies  and  doses. 
The  optimum  performance  is  expected  for  implant 
conditions  which  yielded  the  steepest  gradient  in 
the  tail  of  the  electron  concentration  profile  without 
producing  conducting  holes.  All  implants  were  per¬ 
formed  through  a  50  nm  SiN*  film  and  rapid  thermal 
annealed  at  900°C  for  10  seconds.  The  transistor 
gates  were  0.25  nm  long  with  a  T-shaped  profile. 
Within  the  range  of  Be  implant  conditions  explored, 
the  gradient  in  the  tail  of  the  electron  distribution 
was  strongly  dependent  on  the  dose,  but  did  not 
vary  systematically  with  changes  in  energy.  The 
experiments  carried  out  on  a  single  wafer  indicate 
that  a  large  process  window  exists  for  the  Be 
implant  parameters  and  this  results  in  an  improved 
10  GHz  noise  performance.  This  type  of  optimiza¬ 
tion  with  conventional  broad  beam  implants  would 
require  an  unacceptable  number  of  wafers  and 
would  be  complicated  by  wafer  to  wafer  variations. 

These  results  also  show  that  focused  ion  beam 
implantation  could  be  useful  as  a  production  tech¬ 
nique  if  a  given  integrated  circuit  required  a  large 
variety  of  implants,  each  one  over  a  limited  area. 
In  conventional  fabrication,  each  implant  requires  a 
complete  sequence  of  steps  (resist  spinning,  expo¬ 
sure,  development,  broad  beam  implantation,  and 
resist  stripping).  These  steps  are  eliminated  by  the 


focused  ion  beam  implant  and,  in  some  circum¬ 
stances,  may  justify  the  cost  in  implantation  time. 

4.6  Confined  Carrier  Distributions  in 
lll-V  Semiconductors  Created  by 
Focused  Ion  Beams 

Sponsor 
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One  way  to  obtain  confined  carrier  distributions  for 
quantum  effect  studies  is  to  use  focused  ion  beam 
implantation  damage.  In  particular,  the  insulating 
region  produced  by  the  damage  can  be  used  to 
carve  up  the  two-dimensional  electron  gas  pro¬ 
duced  by  modulation  doping.  One  of  the  effects 
which  may  limit  the  resolution  of  this  technique  is 
the  depletion  of  carriers  around  the  implanted 
region.  We  are  attempting  to  calculate  the  effect  of 
this  depletion  by  using  analogies  to  the  widely 
studied  and  modeled  Si/SiOz  interface  in  MOS 
structures.  The  width  of  the  depletion  will  also  be 
measured  by  writing  different  geometries  to  create 
planar  capacitors  and  transistors  in  GaAs/AIGaAs 
mesas. 


4.7  Focused  Ion  Beam  Lithography  for 
X-ray  Mask  Making 

Sponsor 
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Project  Staff 
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Making  an  x-ray  mask  is  a  challenging  task.  The 
absorber  material  on  the  thin  mask  membrane  must 
be  fabricated  of  high  Z  material  (e.g.,  Au  or  W)  with 
high  aspect  ratio,  (e.g.,  0.25  to  0.5  pm  thick  at 
minimum  dimensions  down  to  0.1  pm).  Two  tech¬ 
niques  for  writing  the  original  pattern  of  the  mask  in 
resist  are  electron  beam  lithography  and  focused 
ion  beam  lithography.  While  electron  beams  have 
been  extensively  developed  and  applied,  ion  beams 
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have  been  used  to  write  patterns  down  to  0.015  pm 
minimum  linewidth  and  show  no  proximity  effects 
even  for  resists  over  high  Z  material.  The  resist 
exposure  times  for  the  two  techniques  are  compa¬ 
rable.  Previously,  we  had  fabricated  x-ray  masks  at 
MIT  with  minimum  linewidth  of  0.05  pm  in  absorber 
0.2  pm  thick.  One  goal  of  this  program  is  to  fabri¬ 
cate  x-ray  masks  for  use  at  the  Center  for  X-ray 
Lithography  at  the  University  of  Wisconsin.  The 
absorber  material  must  be  0.5  pm  thick  because  of 
the  higher  energy  of  the  x-ray  photons  used.  Thus 
the  resist  (PMMA)  thickness  in  this  case  has  to  be 
of  order  0.6  pm,  and  the  lithography  is  more  chal¬ 
lenging  to  accomplish.  The  masks  are  fabricated 
by  exposing  the  PMMA  over  a  thin  gold  plating 
base,  writing  the  pattern  with  either  Be**  or  Si**  ions 
at  260  keV,  developing  the  resist,  and  plating  up 
the  gold  absorber  features.  We  have  determined 
the  optimum  ion  exposure  dose,  developing  condi¬ 
tions  and  plating  conditions.  In  test  structures  we 
have  succeeded  in  producing  gold  features  with  0.1 
pm  lines  0.5  pm  high.  In  the  Prometrix  test  pattern 
which  was  successfully  delivered  to  the  University 
of  Wisconsin  the  minimum  lines  written  were  0.2 
pm  wide. 

4.8  Focused  Ion  Beam  Induced 
Deposition  of  Copper 

Sponsors 
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Focused  ion  induced  deposition  is  used  in  the 
repair  of  integrated  circuits  and  masks.  For  the 
repair  of  circuits,  a  low  resistivity  of  the  deposited 
material  is  desirable.  In  the  deposition  process  a 
local  ambient  of  an  organometallic  gas  is  formed  on 
the  surface  where  the  ion  beam  is  incident.  The 
incident  ions  cause  the  adsorbed  molecules  to  be 
decomposed,  leaving  the  metal,  and  usually  some 
carbon,  deposited  on  the  surface.  Typically  the 
resistivity  of  the  film  is  in  the  100-1000  pQcm 
range,  compared  to  1-10  pQcm  for  pure  metals. 
We  have  achieved  the  first  focused  ion  beam  depo¬ 
sition  of  copper  from  a  novel  organometallic  pre¬ 
cursor  gas,  Cu  (hfac)  TMVS.  Ga*  ions  at  35  keV 


were  used.  Resistivity  and  yield  have  been  meas¬ 
ured  as  a  function  of  temperature  and  average  ion 
current  density.  Submicron  copper  lines  deposited 
at  room  temperature  from  this  precursor  exhibit 
resistivities  as  low  as  70  i^Qcm;  a  sharp  drop  in 
these  values  is  noted  for  deposition  at  70°C,  and 
deposition  on  a  substrate  heated  above  about 
100°C  yields  resistivities  near  those  of  pure  bulk 
copper.  Composition  analysis  by  Auger  Electron 
Spectroscopy  shows  the  high  temperature  deposi¬ 
tion  to  be  nearly  pure  copper.  Deposition  yields  of 
25  copper  atoms  per  incident  Ga*  ion  have  been 
obtained  on  both  silicon  and  silicon  dioxide  sub¬ 
strates,  with  growth  rates  of  up  to  1 3  A  per  second 
at  an  average  ion  current  density  of  200  nA/cm^. 
The  microstructure  of  the  film  will  be  examined  by 
scanning  electron  microscopy  (SEM)  and  trans¬ 
mission  electron  microscopy  (TEM).  The  precursor 
shows  special  promise  for  the  deposition  of  low 
resistivity  submicrometer  interconnects  on 
integrated  circuits. 

4.9  Focused  Ion  Beam  Induced 
Deposition  and  Ion  Milling  as  a 
Function  of  Angle  of  Ion  Incidence 
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In  the  repair  of  integrated  circuits,  x-ray  masks 
focused  ion  beam  induced  deposition  and  ion 
milling  often  have  to  be  performed  over  quite 
nonplanar  topography.  Thus,  the  milling  and  the 
deposition  as  a  function  of  the  angle  of  ion  inci¬ 
dence  are  important.  The  milling  yield  of  Si,  SiOz, 
Au,  and  W  versus  angle  of  incidence  using  25  keV 
Ga*  ions  has  been  measured.  In  qualitative  agree¬ 
ment  with  simulations,  the  yield  rises  with  angle  and 
then  falls  as  grazing  incidence  is  approached. 
Deposition  yield  versus  angle  was  measured  using 
dimethylgold  hexafluoro-acetylacetonate  and  W 
(COje  as  the  precursor  gases.  The  measurements 
were  carried  out  using  cylindrical  quartz  fibers 
30-50  ufn  in  diameter  which  automatically  provide  a 
range  of  angles.  Rippling  of  the  deposited  material 
is  observed  at  angles  of  incidence  greater  than  50°. 
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1.1  Project  Description 

Sponsor 
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We  are  continuing  to  study  equilibrium  properties  of 
quantum  dots — isolated  metallic  systems  with  sizes 
smaller  than  a  typical  length  of  an  electron 
dephasing.  In  contrast  with  the  transport  properties 
of  these  systems,  their  thermodynamics  was  rela¬ 
tively  neglected  by  both  theorists  and 
experimentalists. 

We  are  investigating  theoretically  the  dielectric 
properties  of  quantum  dots.  We  have  considered  a 
neutral  grain  with  a  positive  uniform  background 
and  conduction  electrons  which  are  scattered  by 
disordered  impurity  centers.  Due  to  the  fact  that  the 
grain  is  disordered,  one  should  expect  a  nonuniform 
distribution  of  charge  inside  the  grain.  The  electric 
field  outside  a  grain  is  determined  by  the  charge 
distribution  inside  the  sample.  Therefore,  in  prin¬ 
ciple,  it  is  possible  to  measure  electric  field  outside 
the  grain,  although  the  grains  net  charge  is  equal  to 
zero.  This  means  that  a  neutral  quantum  dot 
without  any  external  electric  field  possesses  a  finite 
dipole  moment  as  well  as  higher-order  multipole 
moments. 

We  have  calculated  the  mean  square  electric 
potential  far  from  the  grain  surface  and  determined 
its  temperature  and  magnetic  field  dependence.' 
We  have  also  considered  the  response  of  quantum 
dot  to  an  external  electric  field  and  determined 


mesoscopic  (sample  to  sample)  fluctuations  of  the 
electric  polarizability.^  The  zero  field  (spontaneous) 
dipole  moment,  as  well  as  the  polarizability, 
depends  significantly  on  the  temperature  and  the 
magnetic  field  manifesting  the  quantum  nature  of 
the  phenomena.  Due  to  screening  of  the  charge 
fluctuations  in  the  bulk  of  the  grain,  both  effects  are 
rather  weak,  so  that  only  the  surface  charge  fluctu¬ 
ations  are  important.  Nevertheless,  for  systems  with 
relatively  low-electron  density  (such  as  semiconduc¬ 
tors),  both  effects  are  well  within  current  exper¬ 
imental  capabilities. 

Modern  experimental  technique  allows  us  to  resolve 
single-electron  energy  levels  in  a  quantum  dot.®  In 
most  cases  the  electrons  withir  a  dot  are  scattered 
irregularly  by  impurities  and  the  surface.  Even  for  a 
"clean"  quantum  dot  with  a  size  much  smaller  than 
the  mean  free  path,  the  motion  of  electrons  can  be 
chaotic  rather  than  integrate  due  to  the  particular 
shape  of  the  grain.  The  electron  energy  spectrum  of 
these  chaotic  systems  turns  out  to  be  rather  irreg¬ 
ular. 

We  have  started  to  develop  a  statistical  approach  to 
the  description  of  chaotic  energy  spectra  in  the 
presence  of  external  fields.  First,  we  have  consid¬ 
ered  a  disordered  mesoscopic  ring  with  Aharonov- 
Bohm  flux  O.  The  energy  of  a  given  electron  state 
£„  (we  will  measure  £„  in  units  of  mean  energy  level 
spacing  A)  is  a  periodic  function  of  <!>  with  a  period 
4>o-hc/e.  At  the  same  time,  on  the  interval 
0 < 4>< <I>o.  this  function  Ea(<I>)  can  be  considered  as 
a  random  one.  We  have  found  that  the  autocor¬ 
relation  function  of  the  derivatives  dEa/d<I>  (i.e.,  of 
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the  currents  carried  by  the  level  a  at  different  systems  including,  for  example,  Rydberg  atoms  in 
fluxes)  is  determined  as  an  external  magnetic  field. 


d<Di  d<I> 


1.2  Publications 


In  the  interval  g’^Oo  <  <  4>o  (where  g  >  >  1  is  the 

conductance  of  the  system  in  units  e^/h  ),  this  cor¬ 
relation  function  is  universal,  i.e.,  independent  on 
the  disorder  and  particular  characteristics  of  the 
quantum  dot; 


This  result,  though  derived  for  the  quantum  dots 
with  sufficiently  strong  disorder,  is  probably  also 
valid  for  clean  quantum  dots  with  chaotic  shape  and 
perhaps  also  for  much  broader  classes  of  quantum 
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Several  years  ago  we  discovered^  that  while  con¬ 
ventional  field-effect  transistors  turn  from  the  off- 
state  to  the  on-state  only  once  as  electrons  are 
added  to  them,  a  very  small  transistor  turns  on  and 
then  off  every  time  one  electron  is  added.  The 
near-term  goal  of  our  research  is  to  understand  why 
the  single  electron  transistor  works  the  way  it  does. 
Our  long-term  goal  is  to  make  the  device  character¬ 
istics  useful  for  applications. 

A  sketch  of  our  first  single  electron  transistor^  is 
shown  in  figure  la.  We  begin  wi*h  a  crystal  of 
GaAs  which  has  such  a  high  density  of  impurities 
that  it  IS  metallic.  This  plays  the  role  of  the  gate. 
On  top  of  this  we  grow  a  layer  of  AIGaAs,  which  is 
an  insulator,  and  a  layer  of  pure  GaAs,  where  the 
electrons  reside.  If  a  positive  voltage  is  applied 
between  the  gate  and  the  source  or  drain,  electrons 
accumulate  at  the  AIGaAs/GaAs  interface. 
Because  of  the  strong  electric  field  at  this  interface. 


each  electron's  energy  for  motion  perpendicular  to 
the  interface  is  quantized,  and  at  low  temperatures 
the  electrons  move  only  in  the  two  dimensions  par¬ 
allel  to  the  interface.  The  special  trick  that  makes 
this  a  single  electron  transistor  is  the  creation  by 
electron  beam  lithography  of  a  pair  of  electrodes  on 
the  tc,";  surface  of  the  GaAs.  When  negative 
voltage  is  applied  between  these  and  the  source  or 
drain,  the  electrons  are  repelled  and  cannot  accu¬ 
mulate  underneath  the  electrodes.  Consequently, 
the  electrons  are  confined  in  a  narrow  channel 
between  the  two  electrodes.  Constrictions  sticking 
out  into  the  channel  reoel  the  electrons  and  create 
potential  barriers  at  either  end  of  the  channel.  A 
plot  of  the  potontial  is  shown  in  figure  1b.  For  an 
electron  to  travel  from  the  source  to  the  drain,  it 
must  tunnel  through  these  barriers.  A  "pool"  of 
electrons  accumulates  between  the  two  con¬ 
strictions,  separated  from  the  leads  by  the  tunnel 
barriers. 

Figure  2  shows  the  current  through  the  device'  as  a 
function  of  the  voltage  Vy  between  the  gate  and  the 
source.  A  very  small  amount  of  voltage  is  applied 
between  the  two  leads,  which  is  just  large  enough 
to  measure  the  tunneling  conductance  between 
source  and  drain.  The  results  are  astounding.  The 
conductance  displays  sharp  resonances  that  are 
almost  periodic  in  Vg.  By  calculating  the  capaci¬ 
tance  between  the  pool  of  electrons  and  the  gate. 
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Figure  la.  (a)  Schematic  drawing  of  the  single  electron 
transistor. 


we  can  show^  that  the  period  is  the  voltage  neces¬ 
sary  to  add  one  electron  to  the  pool. 

There  is  a  simple  way  to  explain  the  periodic  con¬ 
ductance  resonances;®  the  theory  is  called  the 
Coulomb  blockade  model.  When  an  electron 
tunnels  from  one  lead  onto  the  pool  of  electrons 
and  then  onto  the  other  lead,  adding  a  charge  Q  to 
the  pool  requires  energy  Q^/aC,  where  C  is  the  total 
capacitance  between  the  pool  and  the  rest  of  the 
system:  since  you  can  add  no  less  than  one  elec¬ 
tron,  the  flow  of  current  requires  a  Coulomb  energy 
e^/aC.  A  fancier  way  to  explain  this  is  that  because 
of  charge  quantization  there  is  an  energy  gap  in  the 
spectrum  of  states  for  tunneling;  For  an  electron  to 
tunnel  onto  the  pool  its  energy  must  exceed  the 
Fermi  energy  of  the  contact  by  e^/2C,  and  for  a 
hole  to  tunnel  its  energy  must  be  below  the  Fermi 
energy  by  the  same  amount.  Consequently,  the 
energy  gap  has  width  eVC.  If  the  temperature  is 
low  enough  that  kT  <  eV2C,  neither  electrons  nor 
holes  can  flow  from  one  lead  to  the  other. 

With  the  gate  voltage  Vg  we  can  alter  the  energy 
required  to  add  charge  to  the  pool.  Vg  is  applied 
between  the  gate  and  source,  but  if  the  drain- 
source  voltage  is  very  small,  the  source,  drain  and 
pool  are  all  at  almost  the  same  potential.  With  Vg 


Figure  1b.  (b)  Potential  similar  to  the  or.-  ■ ,  .-  a  single 
electron  transistor  plotted  as  a  function  of  position  at  the 
GaAs/AIGaAs  interface. 


applied  the  electrostatic  energy  of  charge  Q  on  the 
pool  is 

E  •  QVg Q^/2C.  (1). 

The  first  term  is  the  attractive  interaction  between 
the  positively  charged  gate  electrode  and  the  nega¬ 
tive  charge  Q,  and  the  second  term  is  the  repulsive 
interaction  among  the  bits  of  charge  in  the  pool. 
Equation  1  shows  that  the  energy  as  a  function  of 
Q  is  a  parabola  with  minimum  at  Qo=  -  CVg. 

By  varying  Vg,  we  can  choose  any  value  of  Qo,  the 
charge  that  would  minimize  the  energy  in  equation 
1  if  the  charge  were  not  quantized.  However, 
because  the  real  charge  is  quantized,  only  discrete 
values  of  the  energy  E  are  possible.  For  all  values 
of  Qo  except  Qo=  -  (N  1/2)e  ,  there  is  a  non-zero 
energy  for  adding  or  subtracting  an  electron.  Under 
these  circumstances,  no  current  can  flow  at  low 
temperature.  However,  if  Qo=  -  (N  1/2)e,  the 
state  with  Q  =  -  Ne  and  that  with  Q  =  -  (N  -Hl)e  are 
degenerate;  the  charge  fluctuates  between  the  two 
values  even  at  zero  temperature.  Consequently, 
the  energy  gap  in  the  tunneling  spectrum  disap¬ 
pears.  and  current  can  flow.  The  peaks  in  conduc¬ 
tance  are,  therefore,  periodic,  occurring  whenever 
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Figure  2.  Conductance  in  units  of  the  quantum  of  con¬ 
ductance  eVh  versus  voltage  on  the  n'GaAs  substrate. 
On  the  logarithmic  plot,  it  is  clear  that  the  transistor  has 
an  on-to-off  conductance  ratio  that  exceeds  10^  for  the 
lowest  gate  voltages. 

the  average  charge  on  the  artificial  atorn  is 
Qo=  -  (N  +i/2)e,  spaced  in  gate  voltage  by  e/C. 

The  Coulomb  blockade  model  accounts  for  charge 
quantization  but  ignores  the  quantization  of  energy 
resulting  from  the  small  size  of  the  region  in  which 
the  electrons  are  confined.  If  one  thinks  of  the 
region  between  the  potential  barriers  as  a  box,  the 
lowest  energy  spacings  are  of  the  order 
where  a  is  the  size  of  the  box.  Because  the  energy 
as  well  as  the  charge  is  quantized,  we  find  it  useful 
to  think  of  the  single  electron  transistor  as  an  artifi¬ 
cial  atom. 

The  energy  level  spectrum  of  the  atom  can  be 
measured  directly  by  observing  the  tunneling 
current  at  fixed  Vg  as  a  function  of  the  voltage  (Vgs) 
between  drain  and  source.  The  Fermi  level  in  the 
source  rises  in  proportion  to  V^s  relative  to  the 
drain,  so  it  also  rises  relative  to  the  energy  levels  of 
the  artificial  atom  (see  the  inset  to  figure  3). 
Current  begins  to  flow  when  the  Fermi  energy  of 
the  source  is  raised  just  above  the  first  quantized 
energy  level  of  the  atom.  As  the  Fermi  energy  is 


raised  further,  higher  energy  levels  in  the  atom  fall 
below  the  Fermi  energy,  and  more  current  flows 
because  there  are  additional  channels  for  the  elec¬ 
tron  to  use  for  tunneling  onto  the  artificial  atom. 
We  measure  the  energies  by  measuring  the  voltage 
at  which  the  current  increases,  or,  equivalently,  the 
voltage  at  which  there  is  a  peak  in  the  derivative  of 
the  current  dl/dVgs.  We  show  results  of  such  a 
measuremenf  in  the  upper  part  of  figure  3. 

Increasing  the  gate  voltage  lowers  all  the  energy 
levels  in  the  atom  by  -  eVg,  so  that  the  entire  tun¬ 
neling  spect'um  shifts  with  Vg.  This  effect  can  be 
observed  by  plotting  the  values  of  Vgs  at  which 
peaks  appear  in  dl/dVgs,  as  is  done  in  the  lower 
part  of  figure  3.  You  can  see  the  gap  in  the  tun¬ 
neling  spectrum  shift  lower  with  increasing  Vg  and 
then  disappear  at  the  charge-degeneracy  point,  just 
as  predicted  by  the  Coulomb  blockade  model.  You 
can  also  see  the  discrete  energy  levels  of  the  artifi¬ 
cial  atom.  The  charge-degeneracy  points  are  the 
values  of  Vg,  for  which  one  of  the  energy  levels  of 
the  artificial  atom  is  degenerate  with  the  Fermi 
energy  in  the  leads  when  Vds  =  0,  because  only 
then  can  the  charge  of  the  atom  fluctuate. 

During  the  past  year,  we  have  developed  a  thor¬ 
ough  understanding  of  the  energy  spectrum  of 
single  electron  transistors.  In  addition,  we  now 
know  what  limits  their  operating  temperature.  As 
shown  in  figure  3,  the  devices  we  have  made  so  far 
have  a  Coulomb-blockade  gap  that  is  about  1  meV 
wide  and  energy  level  spacings  that  are  about  one- 
fifth  to  one-tenth  of  that.  As  the  temperature  is 
raised,  the  conductance  resonances  acquire  a  width 
of  -  4  kT.  Since  kT  =  1  meV  is  equivalent  to  T  =  12 
K,  the  peak-to-valley  ratio  becomes  small  above  a 
few  degrees  K.  Making  the  single  electron  tran¬ 
sistor  useful  will  require  making  its  operating  tem¬ 
perature  much  higher.  This  requires  decreasing  the 
size  because  both  the  Coulomb  energy,  which 
varies  inversely  with  the  capacitance,  and  the 
energy  level  spacings  will  get  larger  for  smaller 
structures.  This  is  the  direction  of  our  next 
research  effort. 


7  E.B,  Foxman,  P.L.  McEuen,  U.  Meirav,  N.S.  Wingreen,  Y.  Meir,  P.A.  Belk,  N.R.  Belk,  M.A.  Kastner,  and  S.J.  Wind,  "The  Effects  of 
Quantum  Levels  on  Transport  Through  a  Coulomb  Island,"  unpublished:  M.A.  Kastner,  "Artificial  Atoms,"  Phys.  Today,  46(1):  24-31. 
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transistor  can  be  detected  by  varying  the  drain-source 
voltage.  When  a  targe  enough  is  applied,  electrons 
overcome  the  energy  gap  and  tunnel  from  the  source  to 
the  pool  of  electrons  (see  inset  of  upper  panel).  Upper 
panel:  Every  time  a  new  discrete  state  is  accessible,  the 
tunneling  current  increases,  giving  a  peak  in  dl/dVa,.  The 
Coulomb  blockade  gap  is  the  region  between  about  - 
O.SmV  and  +  0.3mV  where  there  are  no  peaks.  Lower 
panel:  Plotting  the  positions  of  these  peaks  at  various 
gate  voltages  gives  the  level  spectrum.  Note  how  the 
levels  and  the  gap  move  downward  as  V,  increases. 


2.2  Publications 

Bagwell,  P.F.,  S.L  Park,  A.  Yen,  D.A.  Anloniadis, 
H.l.  Smith,  T.P.  Orlando,  and  M.A.  Kastner. 
"Magnetotransport  in  Multiple  Narrow  Silicon 
Inversion  Channels  Opened  Electrostatically  into 
a  Two-dimensional  Electron  Gas."  Phys.  Rev.  B 
45(16):  9214-9221  (1992). 

Kastner,  M.A.  "The  Single-Electron  Transistor." 
Rev.  Mod.  Phys.  64(3):  849-858  (1992). 

Kastner,  M.A.,  E.B.  Foxman,  P.L.  McEuen,  U. 
Meirav,  A.  Kumar,  and  S.J.  Wind.  "Transport 
Spectroscopy  of  a  Coulomb  Island."  In  Nano¬ 
structures  and  Mesoscopic  Systems.  Eds.  W.P. 
Kirk  and  M.A.  Reed.  San  Diego,  California: 
Academic  Press,  1992. 

McEuen,  P.L.,  E.B.  Foxman,  J.  Kinaret,  U.  Meirav, 
M.A.  Kastner,  N.S.  Wingreen,  and  S.J.  Wind. 
"Self-consistent  Addition  Spectrum  of  a  Coulomb 
Island  in  the  Quantum  Hall  Regime."  Phys. 
Rev.  0  45(19):  11419-11422  (1992). 


44 


RLE  Progress  Report  Number  135 


Chapter  3.  Differential  Resistance  Through  a  Quantum  Dot 


Chapter  3.  Differential  Resistance  Through  a  Quantum  Dot: 
Signature  of  Kondo  Correlation 

Academic  and  Research  Staff 

Professor  Patrick  A.  Lee 

Graduate  Students 

Jari  M.  Kinaret,  Dmitri  B.  Chkiovskii 

Technical  and  Support  Staff 

Imadiel  Ariel 


3.1  Project  Description 
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The  behavior  of  an  atomic  impurity  coupled  to  con¬ 
duction  electrons  has  become  one  of  the  paradigms 
of  condensed  matter  physics.  Competition  between 
on-site  Coulomb  interaction  and  band  hybridization 
produces  the  Kondo  effect:  a  crossover  from  weak 
to  strong  coupling  between  the  localized  and  band 
electrons  below  the  Kondo  temperature,  Tk.  The 
study  of  the  Kondo  effect  has  been  limited, 
however,  by  the  nature  of  the  impurity  system. 
Since  it  is  a  daunting  task  to  drive  the  host  metal 
out  of  equilibrium,  it  is  the  equilibrium  properties  of 
Kondo  impurities  that  have  been  explored. 

In  the  study  of  transport  through  a  quantum  dot,  we 
have  a  new  Kondo  system  in  which 
non-equilibrium— a  semiconductor  quantum  dot 
weakly  coupled  to  its  leads— is  routinely  achieved. 
Anderson's  model  for  a  Kondo  impurity — discrete, 
interacting  levels  coupled  to  a  band-— also  describes 
quantum  dots.  The  discrete  spectrum  of  a  single 
dot  has  been  probed  experimentally  by  transport 
and  capacitance  spectroscopy,  and  the  strong 
on-site  Coulomb  interaction  is  obsen/ed  in 


Coulomb-blockade  conductance  oscillations.' 
Anderson's  model  has  provided  an  excellent  theore¬ 
tical  description  of  these  experiments.^  However,  it 
is  only  the  high  temperature  regime  that  has  been 
explored  experimentally,  while  it  is  below  Tk  that 
the  Kondo  effect  emerges. 

We  have  previously  shown  that  below  Tk,  the 
Kondo  resonance  leads  to  perfect  transparency  of 
the  quantum  dot  at  the  Fermi  energy.  This  leads  to 
dramatic  effects  on  the  lineshape  of  the  conduc¬ 
tance  peaks  as  a  function  of  gate  voltage.^*  We 
recently  realized  that  a  striking  signature  on  the 
Kondo  correlation  appears  in  nonequilibrium  proper¬ 
ties,  such  as  the  differential  conductance,  even  at  a 
temperature  higher  than  Tk.^  The  requirement  is 
that  the  temperature  must  be  less  than  r,  the 
intrinsic  line  width  of  the  resonant  transmission 
peak.  This  latter  condition  is  much  easier  to  realize 
experimentally.  We  have  shown  that  if  the  gate 
voltage  is  set  so  that  we  are  on  the  shoulder  of  a 
conductance  peak  and  the  source  drain  voltage  Vsd 
is  increased,  the  differential  conductance  would 
exhibit  a  peak  around  Vsd=0.  Furthermore,  if  a 
magnetic  field  B  is  applied,  the  differential  conduc¬ 
tance  peak  will  split  into  two  peaks  centered  at  the 
Zeeman  energy  ±  g^BB.  We  believe  that  the  obser¬ 
vation  of  this  conductance  peak  is  a  clean  signature 
of  the  Kondo  correlation  in  the  quantum  dot  system. 
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4.1  Submicron  Structures  Laboratory 

The  Submicron  Structures  Laboratory  at  MIT 
develops  techniques  for  fabricating  surface  struc¬ 
tures  with  feature  sizes  in  the  range  from  nano¬ 
meters  to  micrometers  and  uses  these  structures  in 
a  variety  of  research  projects.  These  projects, 
described  briefly  below,  fall  into  four  major  catego¬ 
ries;  (1)  development  of  submicron  and  nanometer 
fabrication  technology;  (2)  nanometer  and  quantum- 
effect  electronics;  (3)  crystalline  films  on  non-lattice¬ 
matching  substrates;  and  (4)  periodic  structures  for 
x-ray  optics,  spectroscopy  and  atomic  interferom¬ 
etry. 


4.2  X-Ray  Nanolithography 
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A  large  fraction  of  the  applications  pursued  by  the 
Submicron  Structures  Laboratory  require  a  reliable 
lithographic  technique  capable  of  producing 
sub-100  nm  features  with  broad  process  latitude 
and  high  throughput.  Ideally,  the  technique  should 
be  simple,  low-cost,  and  enable  students  to  do  their 
own  lithography  locally.  For  this  reason,  we  have 
been  developing  x-ray  nanolithography,  exploring  its 
limits,  and  making  its  various  components  (e.g., 
mask  making,  resists,  electroplating,  sources,  align¬ 
ment,  etc.)  reliable  and  "user  friendly."  Because  of 
the  critical  importance  of  x-ray  mask  technology,  we 
discuss  this  in  a  separate  section  (4.3). 


1  University  of  Cairo,  Giza,  Egypt,  and  IBM  Corporation,  Thomas  J.  Watson  Research  Center,  Yorktown  Heights,  New  York. 
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Our  sources  for  x-ray  nanolithography  are  simple, 
low-cost  electron-bombardment  targets,  typically 
Cul(X=  1 .32  nm),  separated  by  1 .4  pm-thick  SiN, 
vacuum  windows  from  the  helium-filled  exposure 
chambers.  In  1993  we  will  greatly  increase  our 
throughput  with  a  Hampshire  Instruments  3500X 
laser-plasma  x-ray  source  and  exposure  system. 
The  3500X  will  be  located  in  a  dedicated  room 
adjacent  to  the  Integrated  Circuits  Laboratory. 

We  have  developed  an  apparatus  for  aligning  an 
x-ray  mask  to  substrates  in  X,  Y  and  6  and  control¬ 
ling  the  mask-sample  gap.  Substrates  ranging  from 
10  cm-diameter  Si  wafers  to  1x1  cm  GaAs  chips 
are  handled. 

In  earlier  research  we  showed  that  for  x-ray  wave¬ 
lengths  longer  than  -  1  nm,  the  range  of  the 
photoelectron  emitted  when  an  x  ray  is  absorbed  in 
resist  does  not  limit  resolution.  Down  to  feature 
sizes  ~  20  nm,  diffraction  is  the  only  concern.  By 
means  of  accurate  electromagnetic  calculations, 
taking  into  account  the  vectorial  character  of  the 
electromagnetic  field  and  the  dielectric  properties  of 
the  absorber,  we  have  shown  that  when  source 
spatial  coherence  is  optimized,  diffraction  does  not 
limit  resolution  as  severely  as  had  been  predicted 
by  simple  Fresnel  diffraction  calculations.  The 
results  are  summarized  in  figure  1. 


Minimum  llnewldth  (nm) 


Figure  1.  Plot  of  image  contrast  for  a  grating  pattern  of 
equal  lines  and  spaces  as  a  function  of  linewidth,  W. 
The  parameter  a  is  related  to  gap  G  and  wavelength 
X  via  G  =aWVX.  p  is  the  ratio  of  penumbral  edge  blur  to 
linewidth  p  =  6/W. 

For  the  linewidth  range  from  70  to  20  nm,  mask- 
substrate  gaps  must  be  below  5  pm.  This  is  not  a 
problem  in  research,  but  in  manufacturing  it  may  be 
unacceptable.  For  this  reason  we  are  investigating 
the  feasibility  of  using  arrays  of  zone  plates  for 
projection  imaging  with  x  rays  of  either  4.5  nm  or  ~ 
1 .2  nm  wavelength. 


Figure  2  shows  the  result  of  two  aligned  exposures 
with  x-ray  nanolithography.  This  quantum-effect 
device  is  an  array  of  quasi-one-dimensional  con¬ 
ductors  with  a  crossing  tunnel  gate. 

In  the  nanolithography  range,  the  final  profile 
obtained  in  resist  depends  on  both  the  aerial  image 
and  the  development  characteristics  of  the  resist. 
We  have  developed  a  model  and  simulation 
program  that  enable  us  to  predict  resist  profiles 
given  the  input  parameters.  We  are  setting  up  an 
apparatus  for  checking  experimentally  the  simu¬ 
lation. 


Quasi-1D  PRESTFET  Device 


Figure  2.  Scanning  electron  micrograph  of  a  quantum- 
effect  device  structure  fabricated  using  two  separate, 
aligned  x-ray  nanolithography  exposures.  The  master 
x-ray  masks  were  fabricated  at  the  Naval  Research  Labo¬ 
ratory  (NRL).  The  device  is  an  array  of  quantum  wires 
with  tunneling  barriers  in  the  middle. 
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At  feature  sizes  of  1 00  nm  and  below,  the  mask-to- 
sample  gap,  G,  must  be  precisely  controlled.  The 
mesa  mask  shown  schematically  in  figure  3  yields 
mask  membranes  that  are  flat  to  <  250  nm.  We 
are  improving  this  technology  with  the  flip-bonded 
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Figure  3.  Mesa  style  mask  suitable  for  x  ray  lithography  at  feature  sizes  from  500  to  20  nm.  Membranes  are  flat  to 
<  250  nm. 


scheme  shown  schematically  in  figure  4.  With  this 
technique,  using  polysilicon  as  the  anodic  bonding 
agent  in  place  of  Ni,  we  anticipate  that  x-ray  mask 
blanks  can  be  fabricated  entirely  with  automated 
integrated  circuit  (1C)  processing  equipment. 

Our  mask  technology  is  based  on  low-stress,  Si-rich 
silicon  nitride,  SiN,.  This  material  is  now  produced 
in  the  1C  Laboratory  at  MIT  in  a  new,  dedicated  ver¬ 
tical  LPCVD  reactor.  The  resulting  films  are  clean 
and  uniform,  and  x-ray  mask  membranes  made 
from  them  are  extremely  robust.  They  can  be 
cleaned  and  processed  in  conventional  stations. 
Radiation  hardness  remains  a  problem  at  dose 
levels  corresponding  to  production  (i.e.,  millions  of 


exposures)  but  for  research  the  material  is  excel¬ 
lent. 

For  absorber  patterns  we  use  both  gold,  Au,  and 
tungsten,  W.  Both  can  be  obtained  with  near-zero 
stress  (i.e.,  <  10  MPa)  which  implies  that  pattern 
distortion  should  be  negligible  (i.e.,  <  1  nm).  The 
gold  is  electroplated  onto  the  membrane  after  resist 
exposure  and  development  using  a  specially 
designed  apparatus.  The  W  is  sputter  deposited 
and  patterned  by  reactive-ion  etching.  We  verify 
the  achievement  of  stress  <  10  MPa  using  a  Linnik 
interferometer,  which  we  have  recently  equipped 
with  a  CCD,  a  frame  grabber,  and  special  purpose 
software  that  improves  sensitivity. 
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Figure  4.  Flip-bonded  mask  structure  in  which  the  musa  rim  is  formed  in  the  pyrex  ring.  Membrane  flatness  is  <  100 
nm. 


In  order  to  check  that  zero  stress,  as  determined  by 
Linnik  interferometry,  corresponds  to  zero  in-plane 
distortion,  we  have  developed  a  technique  of  holo¬ 
graphic  interferometry  (HI)  shown  schematically  in 
figure  5.  We  have  shown  a  good  correlation 
between  Linnik  and  HI  measurements,  but  we 
intend  to  further  improve  the  sensitivity  of  the  in¬ 
plane  distortion  measurements. 

Patterning  of  x-ray  masks  is  done  by  holographic 
lithography  for  periodic  structures  but,  for  patterns 
of  arbitrary  geometry,  is  done  by  e-beam  lithog¬ 
raphy,  in  collaboration  with  the  Naval  Research 
Laboratory  (NRL)  and  IBM.  We  use  CAD  tools  at 
MIT  to  convert  the  data  into  formats  compatible  with 
the  e-beam  exposure  systems.  Data  is  shipped  to 
NRL  by  electronic  mail  or  IBM  by  express  mail 
along  with  x  ray  mask  blanks  already  coated  with 
e-beam  resist.  After  e-beam  exposure,  the  masks 
are  shipped  back  to  MIT  by  express  mail  where 
development  and  Au  electroplating  are  carried  out. 
This  collaboration  has  already  demonstrated  that 
x-ray  mask  patterning  by  e-beam  can  be  done 
remotely,  so  that  university  researchers  with  limited 
facilities  can  have  access  to  nanolithography  via  x 
ray  alone  without  owning  or  visiting  an  e-beam 
lithography  facility. 

For  the  production  of  x-ray  masks  with  W  absorber, 
a  reactive-ion-etching  process  is  required,  which 
puts  considerable  power  into  the  membrane  sub¬ 
strate.  Since  membranes  have  very  low  thermal 
mass  and  conductivity,  we  must  heat  sink  the  back 
side.  We  are  constructing  a  new  reactive-ion¬ 
etching  system  that  will  provide  back-side  He 
cooling.  The  cooling  can  be  to  temperatures  as  low 


as  -40°C.  At  such  low  temperatures  the  isotropic 
etching  component  is  suppressed  and  high  resol¬ 
ution  directional  etching  on  membranes  should  be 
possible. 


Figure  5.  Holographic  interferometer  for  detecting  in¬ 
plane  distortion  in  x-ray  mask  membranes. 
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Conventional  scanning-electron-beam  lithography 
(SEBL)  depends  upon  laser  interferometry  to 
achieve  geometric  integrity,  such  as  axis 
orthogonality,  accurate  length  scale,  placement 
accuracy,  etc.  However,  laser  interferometry 
cannot  solve  all  such  geometric  difficulties,  in  part 
because  the  e-beam  location  can  drift  due  to 
thermai  expansion,  charging,  and  many  other 
causes.  This  difficulty  with  conventional  SEBL  is 
particularly  troublesome  in  the  fabrication  of  opto¬ 
electronic  devices  which  require  spatial-phase 
fidelity  over  areas  many  times  larger  than  a  single 
scan  field. 

We  have  proposed  a  scheme  called  spatial-phase¬ 
locking  in  order  to  achieve  high  fidelity  in  SEBL.  In 
this  scheme,  a  spatially-coherent  grating  or  grid, 
created  by  holographic  lithography,  is  located  on 
the  substrate  or,  in  the  case  of  x-ray  mask  mem¬ 
branes,  on  a  second  substrate  below.  All  e-beam 
writing  is  then  done  with  reference  to  this  grid  using 
phase  locking  techniques.  Preliminary  work  has 
been  done  in  collaboration  with  IBM  Corporation  at 
Yorktown  Heights,  New  York.  Whereas  in  conven¬ 
tional  SEBL  stitching  errors  at  the  boundary  of  two 
fields  is  typically  several  times  10  nm,  we  achieved 
an  error  below  2  nm.  We  plan  to  expand  this 
activity,  and,  in  the  long  term,  we  expect  to  provide 
an  entirely  new  paradigm  for  SEBL,  which  will  also 
translate  into  lower  capital  costs. 

In  order  to  better  predict  the  optimal  SEBL  expo¬ 
sure  conditions,  particularly  when  writing  on  x-ray 
mask  membranes,  we  have  implemented  a  Monte 
Carlo  model  for  electron  scattering  and  energy  loss. 
A  model  developed  earlier  by  Hawryluk  et  al.  was 
modified  to  include  the  excitation  of  secondary  elec¬ 
trons.  The  new  model  was  employed  successfully 
to  show  that  the  reduced  proximity  effect  observed 
by  Rhee  et  al.  when  SEBL  was  done  over  a  thin 
Si02  layer  on  top  of  tungsten  was  due  to  the  SiOa 
stopping  the  energetic  secondary  electrons  from  the 
W. 

The  new  model  has  also  been  used  to  validate  the 
experimentally  observed  reduction  of  the  delete¬ 
rious  proximity  effects  due  to  backscattered  elec¬ 
trons  when  SEBL  is  done  on  x-ray  mask 
membranes  rather  than  solid  substrates.  Figure  6 


shows  a  simulation  of  resist  development  following 
e-beam  exposure  on  (a)  a  membrane  and  (b)  a 
solid  Si  substrate. 


4.5  A  High-Precision  Mask  Alignment 
Scheme 
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In  order  for  any  lithographic  technique  to  be  of 
general  utility,  a  compatible  alignment  technique 
must  also  be  provided.  This  technique  must  be 
capable  of  superposition  precision  that  is  a  small 
fraction  of  the  minimum  feature  size. 

We  have  demonstrated  a  novel  alignment  scheme 
that  builds  on  the  principles  of  an  interferometric 
scheme  we  invented  several  years  ago,  but  over¬ 
comes  its  shortcomings.  Alignment  marks  are 
simple  linear  gratings  with  spatial  periods  around  1 
micron  so  that  the  diffraction  angle  for  visible  light  is 
around  30  degrees.  The  spatial  periods  of  the 
mask  and  substrate  marks  differ  by  a  few  percent 
so  that  beams  that  are  diffracted  back  onto  the 
optical  axis  produce  a  moire  pattern  that  can  be 
imaged  on  a  CCD.  The  mask  alignment  mark  also 
contains  a  fixed  fiducial  grating  whose  spatial 
period  matches  that  of  the  moir§  pattern.  Align¬ 
ment  is  defined  as  that  condition  wherein  the  spatial 
phase  of  the  moir§  (which  moves  as  the  mask  is 
piezoelectrically  displaced  relative  to  the  substrate) 
matches  the  spatial  phase  of  the  fiducial  grating. 

Using  this  scheme,  a  superposition  precision  with  1 
o  equal  to  6  nm  was  achieved,  as  shown  in  figure 
7.  Analysis  has  shown  that  the  theoretical  limit  is 
about  0.6  nm.  Clearly,  this  would  be  compatible 
with  minimum  lithographic  features  ~  20  nm,  which 
we  consider  the  practical  limit  of  resist-based  lithog¬ 
raphy. 

We  plan  to  work  with  Hampshire  Instruments  to 
develop  a  commercial  version  of  this  alignment 
scheme  and  install  it  in  our  3500X  x-ray  lithography 
system. 


*  Hampshire  Instruments,  Inc.,  Rochester,  New  York. 
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exposed  at  twice  the  clearing  dose,  (a)  (top)  For  a  1  pm-thick  Si  membrane  substrate,  (b)  (bottom)  For  a  solid  sub¬ 


strate.  Successive  contours  represent  30  sec  o1  additional  development.  Note  the  reduction  ol  proximity  effects  for 


exposure  on  a  membrane  substrate. 
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Figure  7.  Histogram  of  20  successive  aligned  x-ray 
exposures,  showing  o  =  6  nm.  Offset  of  peak  is  not  sig¬ 
nificant.  A  moire  method  was  used  to  measure  misalign¬ 
ment. 


4.6  Achromatic  Holographic 
Lithography 
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Holographic  schemes  are  preferred  for  the  fabri¬ 
cation  of  periodic  and  quasi-periodic  patterns  that 
must  be  spatially  coherent  over  large  areas  and 
free  of  phase  steps.  For  spatial  periods  below  200 
nm  light  sources  with  wavelengths  below  200  nm 
must  be  used.  All  such  sources  have  limited 
temporal  coherence,  and  thus  one  is  forced  to 
employ  achromatic  schemes  such  as  shown  in 
figure  8.  Using  this  apparatus,  we  have  achieved 
gratings  of  100  nm  period  (i.e.,  50  nm  lines  and 


spaces).  In  order  to  make  the  apparatus  more  reli¬ 
able  the  depth-of-focus  has  to  be  increased,  which 
implies  an  improvement  of  the  source  spatial  coher¬ 
ence.  To  this  end,  we  have  introduced  a 
collimating  lens  and  a  slit  scanning  system.  In 
order  to  enhance  yield  and  area,  it  will  be  neces¬ 
sary  to  find  the  substrate  location  that  provides 
maximum  image  contrast.  Therefore,  we  have  set 
up  a  white-light  interferometer  that  utilizes  optical 
paths  through  the  quartz  plates  that  closely  approxi¬ 
mate  the  paths  of  the  exposing  beams.  A 
photodetector  and  lock-in  amplifier  will  be  used  to 
find  the  optimum  sample  position. 

Once  the  system  is  fully  reliable,  we  plan  to  take 
the  technique  one  step  further  to  50  nm  periods  (25 
nm  lines  and  spaces)  using  a  13  nm  undulator  as 
the  light  source. 

4.7  Fabrication  of  T-gate  Devices  using 
X-ray  Lithography 

Sponsor 
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High-speed  MODFET  devices  require  very  short 
gate  lengths  while  preserving  low  resistance.  Large 
gate  widths  are  required  for  high  current  drive.  To 
meet  these  conflicting  demands,  researchers  have 
developed  so-called  "T-gate"  and  "gamma-gate" 
processes  in  which  the  base  of  the  gate  is  very 
short  (-  100  nm)  while  the  upper  part  is  large  and 
overlaps  the  short  base,  like  a  mushroom  or  the 
letters  T  or  gamma.  Such  structures  are  readily 
achieved  using  direct-write  electron-beam  lithog¬ 
raphy.  However,  this  technology  is  expensive, 
slow,  and  probably  incompatible  with  mass  pro¬ 
duction. 

For  these  reasons,  we  are  developing  an  x-ray 
lithography-based  approach  to  making  Tee  and 
gamma  gate  devices.  Several  schemes  are  being 
pursued.  The  most  straightforward  involves  a  tri¬ 
layer  resist.  The  first  layer  is  an  x-ray  sensitive 
resist  (e.g.,  PMMA  or  a  chemically  amplified 
novolak),  the  second  is  a  barrier  layer  (e.g., 
photosensitive  polyimide)  between  the  x-ray  resist 
and  the  upper  layer,  which  is  photoresist.  The 
upper  layer  and  the  polyimide  are  exposed  with  UV 
light.  This  tri-level  process  provides  maximum  flexi¬ 
bility  and  a  large  process  window,  but  requires  two 
separate  alignments  and  exposures.  On  the  other 
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Achromatic  Holographic 
Lithography 


sin9  =  X/(nP), 


A  =  A/(2n  sin0) 
=  P/2 


Figure  8.  Achromatic  holographic  lithography  configuration  for  generating  100-nm-period  gratings. 

hand,  the  photolithography  does  not  have  to  be 
aligned  as  precisely  as  the  x-ray  exposure.  Other 
approaches  will  also  be  investigated  in  which,  for 
example,  three  different  x-ray  resist  sensitivities  are 
employed  and  only  one  alignment  and  exposure  is 
needed. 
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4.8  Tenth-micron  MOSFET  Device 
Technology 
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We  have  continued  using  x-ray  lithography  to  fabri¬ 
cate  both  N-  and  P-  channel  MOSFET  devices  with 
effective  channel  lengths  down  to  80  nm.  The 
emphasis  is  on  investigating  hot-carrier  quasi- 
ballistic  transport  processes. 

A  new  generation  of  x-ray  masks  capable  of 
defining  sub-0.1  pm  polysilicon  gates  and  compat¬ 
ible  with  conventional  MOSFET  device  and  circuit 
processing  were  successfully  developed  and  tested. 
A  portion  of  a  ring  oscillator  circuit,  patterned  in  Au 
on  an  x-ray  mask,  is  shown  in  figure  9.  A 
chemically  amplified  resist  (PF-514)  is  used  for  the 
0.1  pm  gate  definition,  as  shown  in  figure  10. 

Self-aligned  P-channel  MOSFET  devices  were  fab¬ 
ricated  on  N-type,  20  Q-cm  (100)  silicon  substrates. 
In  order  to  maximize  channel  hole  mobility  and  to 
control  deep  punchthrough  down  to  Le^  of  0.1  pm, 
super-steep-retrograde  (SSR)  channel  doping  was 
used.  It  consists  of  pf^osphorus  ion  implantation 
with  energy  of  180  KeV  and  dose  of 
5.0  X  10'^  cm- 3,  and  arsenic  ion  implantation  with 
energy  ranging  from  70  KeV  to  94  KeV  and  dose 
ranging  from  4.0  x  1 0'^  cm-  ^  to  1 .0  x  1 0’^  cm-  T  The 
5.2  nm  thick  gate  oxide  was  thermally  grown  at 
800“  C.  In  order  to  form  the  P*-N  source-drain  junc¬ 
tions,  germanium  was  first  implanted  to  amorphize 
the  source-drain  area  and  a  BFj  implant  with 
energy  of  10  KeV  and  dose  of  4.0  x  10’'‘cm-3  was 
carried  out  to  form  the  shallow  P  -  N  junction 
extensions.  A  10  nm  thick  oxide  spacer  was  then 
formed.  A  second  BF2  implant  with  energy  of  20 
KeV  and  dose  of  5.0  x  lO’^cm-^  formed  the  P*  -  N 
junctions  in  the  area  defined  by  the  previously 
formed  oxide  spacer.  The  P*  -  N  junction  annealing 
consisted  of  a  600“C,  30  min  furnace  diffusion  fol¬ 
lowed  by  10  sec  RTA  at  1000“C.  Self-aligned 
cobalt  silicide  with  thickness  of  about  40  nm  was 
formed  to  reduce  the  sheet  resistance  of  thj 
source-drain  junctions.  Typical  values  of  total 
source  and  drain  resistance  were  about  P5G  Qpm. 
Finally,  CV'D  oxide  contact  cuts  and  "  omi  ium  con¬ 
tacts  were  formed  to  the  gates  and  the  source-drain 
regions. 


Two  SSR  channel  doping  profiles  calculated  by 
SUPREM-III  are  shown  in  figure  11.  Two  sets  of 
P-MOSFETs  with  Un  ranging  from  0.1  pm  to  10  pm 
obtained  with  these  two  SSR  doping  profiles  all 
exhibit  excellent  punchthrough  characteristics.  The 
I  -  V  characteristics  of  0.1  pm  P-MOSFET  devices 
at  300  K  and  77  K  is  shown  in  figure  12.  The  peak 
hole  n. ability  at  low  longitudinal  E  -  field  was  92 
cmW-s.  The  room  temperature  threshold  voltage 
was  -0.4  V.  This  shows  that  a  good  compromise 
between  reducing  short  channel  effects  and  main¬ 
taining  practical  device  integrity  is  reached  for  0.1 
pm  scale  P-channel  MOSFET  devices  with  SSR 
channel  doping  profiles. 


X-ray  Mask  for  0.1  pm  Gate  Length 
NMOS  Ring  Oscillator 

(Mask  written  by  e-beam  lithography  at  NRL) 


Figure  9.  X-ray  mask  of  a  0.1  pm  gate-length  NMOS 
ring  oscillator  (x-ray  mask  written  by  e-beam  lithography 
at  the  Naval  Research  Laboratory). 
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Figure  10.  Resist  pattern  for  a  short-channel  MOSFET  exposed  in  the  chemically  amplified  resist  PF514  using  x-ray 
lithography  at  >^  =  1.32  nm. 
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Figure  11.  Dopant  concentration  as  a  function  of  depth 
as  calculated  by  Supreme  III. 


Figure  12.  I-V  characteristics  of  0.1  urn  PMOSFET 
devices  at  77  K  and  300  K. 


4.9  Studies  of  Coulomb  Charging 
Effects  in  Semiconductor 
Nanostructures 
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Many  novel  effects  have  been  observed  in  the 
transport  properties  of  ultrasmall  "quantum-effect" 
transistors  whose  feature  si^e  is  comparable  to  the 
electron  Fermi  wavelength  (about  50  nm  in  a  typical 


inversion  layer).  In  addition  to  the  prominent  role 
played  by  these  effects  in  solid-state  physics,  they 
have  great  potential  in  the  technology  of  the  elec¬ 
tronics  revolution. 

In  a  "quantum  dot"  device,  an  electron  gas  is  con¬ 
fined  electrostatically  in  all  three  spatial  dimensions, 
forming  a  small  "puddle"  of  electrons  bounded  on 
all  sides  by  a  potential  wall.  This  small  electron 
"puddle"  resembles  an  atom  in  that  there  can  only 
be  an  integer  number  of  electrons,  and  these  elec¬ 
trons  can  occupy  only  certain  discrete  energy 
levels. 

Recent  low-temperature  experiments  on  transport 
through  a  narrow  electron  channel  interrupted  by 
two  tunnel  barriers — forming  an  isolated  electron 
"quantum  dot"— have  found  that  the  conductance 
through  the  dot  consists  of  a  series  of  sharp 
maxima  which  occur  periodically  as  the  voltage  on 
the  transistor  gate  is  varied.  This  remarkable  con¬ 
ductance  modulation  arises  from  the  condition  of 
charge  quantization  inside  the  dot.  Each  succes¬ 
sive  conductance  maximum  corresponds  to  the  dis¬ 
crete  addition  of  a  single  electron  to  the  dot. 
Between  successive  conductance  peaks,  the  resist¬ 
ance  increases  by  several  orders  of  magnitude 
because  there  is  a  large  energy  cost  for  an  electron 
in  the  contact  to  enter  the  dot.  This  energy  cost 
can  be  removed  by  changing  the  gate  voltage, 
resulting  in  the  observed  periodic  dependence  of 
the  conductance  on  gate  voltage. 

This  conductance  modulation  through  an  electron 
puddle  has  been  studied  extensively  and  is  now 
well  understood  We  propose  to  study  a  device 
structure  with  two  electron  puddles  which  can 
interact  with  each  other  through  a  thin  tunnel 
barrier.  In  addition  to  exploring  new  physics,  we 
are  motivated  by  the  consideration  that  such  a 
coupled  system  would  represent  the  first  step 
toward  an  architecture  in  which  two  elements  could 
interact  with  each  other  without  an  interconnect. 
Figure  13  shows  a  scanning  electron  micrograph  of 
the  gate  geometry  of  our  device  structure,  fabri¬ 
cated  by  electron  beam  lithography  at  the  National 
Nanofabrication  Facility.  A  negative  voltage  applied 
to  the  gates  depletes  the  high-mobility  two- 
dimensional  electron  gas  (2DEG)  underneath, 
which  is  formed  at  the  GaAs/AIGaAs  interface, 
leaving  behind  two  small  electron  puddles.  Four 
ohmic  contacts  at  the  corners  allow  access  to  the 
2DEG;  both  series  and  parallel  combinations  of  the 
dots  can  be  realized  depending  on  the  bias  volt¬ 
ages. 


5  Cornell  University,  Ithaca,  New  York. 
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Coupled-Coulomb-Blockade  Device 


45  nm 


Figure  13.  Gate  electrodes  fabricated  by  electron  beam 
lithography  and  liftoff  over  a  GaAs/AIGaAs  heterostruc¬ 
ture.  A  negative  voltage  on  the  gate  electrodes  depletes 
the  2DEG  underneath,  leaving  behind  two  electron 
islands.  Ohmic  contacts  at  the  four  corners  allow  access 
to  the  2DEG. 

We  are  also  working  on  studying  Coulomb  charging 
effects  in  devices  fabricated  by  x-ray  lithography. 
Figure  14  shows  a  scanning  electron  micrograph  of 
an  x-ray  mask  which  will  be  used  for  patterning  the 
gate  geometry  of  a  device  with  a  single  electron 
puddle.  The  corresponding  device  will  offer  inde¬ 
pendent  control  of  the  potential  barriers  confining 
the  dot  and  the  number  of  electrons  occupying  the 
dot.  This  independent  control  has  been  found  in 
previous  experiments  to  improve  the  yield  of 
devices  showing  Coulomb  blockade  effects. 


4.10  Study  of  Quasi-One-Dimensional 
Wires  in  GaAs/AIGaAs  Modulation 
Doped  Field-Effect  Transistors 
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Coulomb  Blockade  Device 


100  nm 


Figure  14.  X-ray  mask  to  pattern  gate  geometry  of  a 
quantum  dot  device  with  a  single  electron  puddle.  Note 
that  the  potential  barrier  at  each  constriction  and  the 
number  of  electrons  in  the  dot  can  be  controlled  indepen¬ 
dently. 

A.  Ghanbari,  Professor  Terry  P.  Orlando,  Professor 
Henry  I.  Smith 

We  have  successfully  fabricated  quasi-one-di- 
mensional  (QID)  wires  using  x-ray  lithography  and 
precise  chemical  etching.  These  devices  were  fab¬ 
ricated  on  a  high  quality  GaAs/AIGaAs  modulation- 
doped  substrate.  By  first  patterning  the  wires  using 
x-ray  lithography,  then  chemically  etching  the  GaAs, 
the  region  under  the  etched  areas  become  depleted 
of  electrons,  leaving  behind  many  parallel  quasi- 
one-dimensional  wires.  A  schematic  cross-section 
of  this  device  is  shown  in  figure  15. 

Because  the  operation  of  this  device  depends  crit¬ 
ically  on  the  shape  of  the  confinement  potential, 
control  of  the  etch  depth  is  crucial.  Dry  etching 
techniques  have  sufficient  repeatability  for  this 
application,  but  they  cause  an  unacceptable 
amount  of  damage  to  the  substrate.  Chemical 
etching  minimizes  damage  but  has  a  very  unpre¬ 
dictable  etch  rate. 

To  overcome  these  difficulties,  a  high  precision 
interferometric  technique  was  developed  that  is 
capable  of  resolving  depth  changes  of  less  than  1 
nm.  By  capturing  the  image  formed  in  a  Linnik 
interferometer  and  using  Fourier  analysis  tech¬ 
niques,  we  can  monitor  the  etch  depth  with  a  preci¬ 
sion  <  1  nm,  permitting  the  reliable  fabrication  of 
these  devices. 

When  the  QID  devices  are  measured  at  cryogenic 
temperatures,  the  quantum  confinement  clearly 
manifests  itself  as  a  nonlinear  conductance,  as 
shown  in  figure  16.  Each  nonlinear  bump  in  the 
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transconduclance  (shown  in  the  insert  in  figure  16) 
corresponds  to  the  Fermi  level  passing  through  dif¬ 
ferent  subbands  of  the  Q1 D  confining  potential.  A 
schematic  of  the  confinement  potential,  as  seen  by 
electrons  traveling  down  the  wire,  is  shown  in  figure 
17. 

Work  is  currently  undenway  to  harness  these  Q1D 
wires  in  a  new  type  of  planar  resonant-tunneling 
device  with  Q1D  emitter  and  collector 
(Q1 D-PRESTFET)  that  has  been  predicted  to  show 
very  strong  non-linearities  in  conductance. 


Figure  15.  Schematic  cross-section  of  completed  quasi- 
one-dimensional  device. 


Figure  16.  Measured  conductance  and  transconduc¬ 
tance  for  a  multiple-parallel  quasi-one-dimensional 
device. 


4.11  Planar-Resonant-Tunneling 
Field-Effect  Transistors  (PRESTFET) 

Sponsor 

U.S.  Air  Force  -  Office  of  Scientific  Research 
Grant  F49620-92-J-0064 

Project  Staff 

Professor  Dimitri  A.  Antoniadis,  Mike  T.  Chou, 
William  Chu,  Martin  Burkhardt,  Professor  Henry  I. 
Smith 


Figure  17.  Schematic  of  potential  seen  by  electrons 
traveling  along  a  OiD  wire  (arrows  indicate  direction  of 
current  flow). 


Planar-resonant-tunneling  field-effect  transistors 
were  built  on  modulation  doped  GaAs/AIGaAs  sub¬ 
strates  using  x-ray  lithography,  making  it  possible  to 
achieve  good  step  coverage  onto  a  120  nm  high 
mesa,  as  shown  in  figure  18.  This  device,  with  200 
nm  gate  pitch,  showed  good  transistor  character¬ 
istics  but  coherent  resonant  tunneling  between 
source  and  drain  was  not  observed.  Efforts  are 
undenway  to  shrink  the  gate  pitch  to  150  and  100 
nm.  We  expect  to  see  resonant  tunneling  in  these 
devices  at  liquid  Helium  temperature. 

We  are  also  investigating  the  feasibility  of 
PRESTFETs  in  Si.  Top  and  sideviews  of  the 
device  are  shown  in  figure  19.  This  device  has  a 
two-level  gate  structure.  A  very  thin,  high-quality 
layer  of  silicon  dioxide  (-lOnm)  is  grown  on  a 
p-type  silicon  substrate.  Then,  two  parallel  tun¬ 
neling  electrodes  (lower  level  gates)  will  be  pat¬ 
terned  in  polysilicon  using  x-ray  lithography  and 
liftoff.  A  thin  layer  (-lOnm)  of  insulating  silicon 
nitride  will  be  deposited  over  this  structure.  Finally, 
a  large  metal  gate  (the  upper  level  gate)  will  be 
deposited  over  the  silicon  nitride.  A  two- 
dimensional  electron  gas  (2-DEG)  will  be  formed  at 
the  Si/SiOz  interface  by  applying  a  positive  bias  to 
the  upper  gate.  Tunnel  barriers  in  the  2-DEG  will 
be  created  directly  underneath  the  lower  gates  by 
applying  appropriate  bias  voltages. 

This  silicon  PRESTFET  offers  the  exciting  possi¬ 
bility  of  potential  modulations  over  extremely  small 
distances  (on  the  order  of  10  nm).  For  the  equiv¬ 
alent  GaAs  device,  the  distance  between  the  gate 
electrodes  and  the  2-DEG  is  limited  to  the  minimum 
thickness  of  the  n-doped  AIGaAs  layer  that  is 
required  to  achieve  a  certain  electron  concentration 
at  the  2-DEG  (i.e.,  about  50  nm).  The  main  disad- 
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vantage  of  the  silicon  system  is  its  low  mobility 
compared  to  GaAs.  But,  if  we  optimize  the  Si/SiOi 
interface  quality  by  avoiding  contamination  and  con¬ 
trolling  the  damage  caused  by  processing  steps,  it 
is  possible  to  obtain  an  interface  defect/impurity 
concentration  as  low  as  1  x  10'°  cm- This  implies 
an  average  spacing  between  defects  of  about  100 
nm.  Since  our  goal  for  the  silicon  PRESTFET  is  to 
fabricate  a  50  nm-wide  quantum  well  sandwiched 
between  two  50  nm-wide  barriers,  carrier  mobility, 
and  hence  scattering,  should  not  be  a  fundamental 
limitation.  We  expect  to  observe  quantum  transport 
effects  at  4.2  K. 


Figure  18.  Electron  micrograph  of  PRESTFET  on  GaAs 
fabricated  using  x-ray  lithography,  which  enables  good 
step  coverage  up  onto  the  120  nm-high  mesa. 


Figure  19.  Schematic  of  Si  PRESTFET. 
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4.12  Dual  Electron  Waveguide  Device 
Fabricated  using  X-ray  Lithography 

Sponsor 

National  Science  Foundation 
Grant  DMR  87-19217 
Grant  DMR  90-22933 

Project  Staff 

Professor  Jesus  A.  del  Alamo,  William  Chu, 
Cristopher  C.  Eugster,  Professor  Henry  I.  Smith. 

We  have  recently  fabricated  a  dual-electron- 
waveguide  device,  shown  in  figure  20.  using  x-ray 
nanolithography.  An  electron  waveguide  is 
essentially  a  one-dimensional  channel  which  elec¬ 
trons  can  travel  through  without  scattering.  In  our 
device,  two  such  channels  are  electrostatically 
formed  in  close  proximity  of  each  other  by  depleting 
those  electrons  in  an  AIGaAs/GaAs  modulation- 
doped  heterostructure  which  reside  underneath  the 
three  gates  shown  in  figure  20.  The  x-ray  mask 
used  in  the  fabrication  process  was  patterned  using 
50  keV  e-beam  lithography  at  the  Naval  Research 
Laboratory.  We  use  x-ray  lithography  to  replicate 
this  mask  to  change  its  polarity.  Then  we  align  the 
new  mask  to  the  AIGaAs/GaAs  sample  and  expose 
it  using  contact  x-ray  nanolithography. 

In  an  electron  waveguide,  similar  to  an  optical 
waveguide,  discrete  transverse  modes  arise  due  to 
the  lateral  confinement.  The  conductance  of  each 
waveguide  mode  is  equal  to  a  fundamental  con¬ 
stant  2  e^/h.  This  results  from  the  cancellation  of  the 
energy  dependence  in  the  product  of  the  ID 
density  of  states  and  the  electron  velocity.  In 
recent  experiments  on  x-ray  fabricated  dual  electron 
waveguide  devices,  quantized  conductance  for 
each  waveguide  was  observed  as  the  number  of 
occupied  modes  was  electrostatically  varied. 

4.13  Far-Infrared  Spectroscopy  of 
Arrays  of  Quantum  Dots  and  Wires 

Sponsors 

Joint  Services  Electronics  Program 
Contract  DAAL03-92-C-0001 
U.S.  Air  Force  -  Office  of  Scientific  Research 
Grant  F49620-92-J-0064 


Coupled  Electron 
Waveguides 


Middle  gate;  60  nm  linewidth 


Figure  20.  Electron  micrograph  of  dual-electron- 
waveguide  device  fabricated  using  x-ray  nanolithography. 

Project  Staff 

Professor  Dimitri  A.  Antoniadis,  Martin  Burkhardt, 
Reza  A.  Ghanbari,  Professor  Terry  P.  Orlando,  Pro¬ 
fessor  Henry  I.  Smith,  Professor  M.  Shayegan,® 
Song  Sang-hun,®  Professor  Daniel  Tsui,®  Kenneth 
Yee,  Yang  Zhao 

In  collaboration  with  researchers  at  Princeton  Uni¬ 
versity,  we  have  used  x-ray  lithography  to  fabricate 
devices  for  magneto-optic  measurements  of  a  two- 
dimensional  electron  gas  (2DEG).  Metal  grids  with 
spatial  periods  of  300-600  nm  in  both  orthogonal 
directions  and  covering  several  square  millimeters 
were  placed  on  the  surface  of  a  modulation-doped 
AlGaAs  substrate.  The  grids  were  used  to  period¬ 
ically  modulate  far-infrared  radiation  (FIR)  incident 
on  the  2  DEG.  Due  to  the  close  proximity  of  the 
grids  to  the  heterointerface  (-  50  nm),  it  was  pos¬ 
sible  to  access  an  electron-electron  interaction 
effect  by  the  FIR  radiation  at  the  wave  vectors  of 
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the  metal  grid.  These  spectra  were  observed  as  a 
function  of  magnetic  field.  Results  were  in  agree¬ 
ment  with  theoretical  models. 

Using  similar  lithographic  techniques,  metal  gratings 
with  spatial  periods  of  300-600  nm  were  placed  on 
modulation-doped  substrates  (figure  21a),  and  a 
gate  electrode  was  attached  to  the  grating. 
Depending  on  the  applied  gate  voltage,  the  poten¬ 
tial  seen  by  the  electrons  at  the  2DEG  can  be 
varied  from  uniform  (in  which  case  the  electrons 
behave  as  a  normal  2  DEG)  to  weakly-coupled 
quantum  wires  (figure  21b),  or  to  isolated  quantum 
wires  (figure  21c).  These  quantum-wire  structures 
were  measured  using  FIR  cyclotron  resonance. 
Mode  softening  due  to  quantum-mechanical  cou¬ 
pling  between  wires  was  observed.  The  results 
were  in  agreement  with  theoretical  models  of 
coupled-quantum  wires. 

Arrays  of  etched  quantum  dots,  quantum  wires,  and 
quantum  anti-dots  have  also  been  fabricated,  with 
typical  spatial  periods  of  300-600  nm.  The  elec¬ 
tronic  subbands  of  these  structures  were  also 
examined  using  FIR  cyclotron  resonance.  Cur¬ 
rently,  we  are  continuing  our  study  using  extremely 
high-quality  samples  prepared  by  Professor 
Shayegan's  group  at  Princeton. 


Figure  21 .  (a)  Metal  grating  gate  on  a  modulation-doped 
AIGaAs/GaAs  substrate;  (b)  depiction  of  potential  seen  by 
electrons  at  the  AIGaAs/GaAs  interface  for  weakly 
coupled  quantum  wires;  (c)  potential  for  the  case  of  iso¬ 
lated  quantum  wires. 


4.14  Ridge-Grating 

Distributed-Feedback  Lasers  Fabricated 
by  X-Ray  Lithography 

Sponsor 

U.S.  Army  Research  Office 
Grant  DAAL03-92-G-0291 

Project  Staff 

James  M.  Carter,  Woo-Young  Choi,  Juan  Ferrera, 
Professor  Clifton  G.  Fonstad,  Jr.,  Professor  Henry  I. 
Smith,  Vincent  V.  Wong 

Distributed-feedback  lasers  (DFB)  are  essential 
components  in  optical  communications  systems 
because  they  operate  in  a  single-longitudinal-mode 
and  are  easily  integrable  with  electronic  drive  circu¬ 
itry.  In  the  fabrication  of  a  typical  DFB  laser,  an 
epitaxial  regrowth  step  is  carried  out  on  top  of  the 
grating,  which  can  be  yield-limiting.  In  particular, 
because  of  etch  back,  the  coupling  constant  after 
epi-growth  can  be  difficult  to  predict.  In  the  fol¬ 
lowing  paragraphs  we  describe  a  novel  DFB  laser 
that  eliminates  the  need  for  regrowth  and  maxi¬ 
mizes  the  utility  of  ridge-waveguide  devices  which 
are  much  simpler  to  fabricate. 

To  realize  such  a  device  structure,  we  have  devel¬ 
oped  a  fabrication  process  that  utilizes  x-ray  lithog¬ 
raphy  for  the  grating  fabrication  because  it  offers 
good  process  latitude,  high  throughput,  long-range 
spatial-phase  fidelity  in  the  gratings,  and  large 
depth-of-focus,  which  is  essential  for  sidewall  pat¬ 
terning.  A  schematic  of  the  ridge-grating  DFB  laser 
is  shown  in  figure  22.  After  the  epitaxial  growth  of 
a  complete  InGaAIAs  graded-index,  separate-con¬ 
finement,  multi-quantum-well-layer  diode  structure 
by  molecular-beam  epitaxy  on  an  InP  substrate,  a 
3-4  nm-wide  ridge  waveguide  is  formed  by  wet¬ 
etching.  The  wet-etching  conditions  are  carefully 
controlled  so  that  the  desired  side-wall  profile  is 
obtained.  A  first-order  grating  with  a  period  of  230 
nm  for  the  target  lasing  wavelength  of  1.55  urn  is 
then  patterned  in  resist  using  x-ray  lithography. 
The  Cut-line  {K  =  1 .32  nm)  of  an  electron- 
bombardment  source  is  used  in  conjunction  with 
SiN,  mask  technology.  The  x-ray  mask  is  patterned 
using  holographic  lithography,  which  guarantees 
excellent  uniformity  and  long-range  spatial-phase 
coherence  in  the  grating,  followed  by  gold 
electroplating.  Dry-etching  is  used  to  transfer  the 
grating  into  both  sides  of  the  ridge  waveguide. 
With  this  ridge-grating,  the  optical  feedback  occurs 
via  the  interaction  of  the  lateral  fields  with  the  index 
modulation  introduced  by  the  grating.  Holographic 
lithography  could  not  be  used  directly  for  the  grating 
exposure  because  of  troublesome  coherent  reflec- 
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tions  from  the  ridge  topography.  For  advanced 
optical  communication  systems  (e.g.,  those  imple¬ 
menting  wavelength-division  multiplexing),  quarter- 
wave-shifted  DFB  lasers,  hundreds  of  microns  in 
length,  are  necessary.  This  will  require  e-beam 
lithography  that  is  free  of  distortion  and  stitching 
errors  for  the  x-ray  mask  making.  As  described  in 
section  4.4,  we  are  pursuing  a  technique  based  on 
a  global-fiducial  grid  scheme  which  can  potentially 
solve  both  of  these  problems.  Once  developed, 
this  technique  will  be  incorporated  into  the  e-beam 
lithography  process  to  define  the  appropriate  DFB 
laser  grating  patterns  onto  a  SiN,  x-ray  mask. 


Figure  22.  Schematic  of  ridge-grating  DFB  laser. 


4.15  Channel-Dropping  Filters 
Fabricated  by  X-Ray  Lithography 

Sponsor 

U.S.  Army  Research  Office 
Grant  DAAL0-92-G-0291 

Project  Staff 

Jay  N.  Damask,  Professor  Hermann  A.  Haus,  Pro¬ 
fessor  Leslie  A.  Kolodziejski,  Professor  Henry  I. 
Smith,  Vincent  V.  Wong 

Channel-dropping  filters  are  novel  optoelectronic 
devices  that  can  extract  a  signal  of  a  particular 
wavelength  from  an  optical  bus  carrying  multiple- 
signals.  The  extraction  is  achieved  through  the 
interaction  of  three  coupled  waveguides.  A  sche¬ 
matic  of  a  channel-dropping  filter  (CDF)  is  shown  in 
figure  23.  The  middle  waveguide,  which  carries 
several  signals,  is  side-coupled  to  two  quarter- 


wave-shifted  grating  resonators.  These  grating 
resonators  are  resonant  at  frequency  fj  and  serve 
to  extract  this  particular  frequency  component  from 
the  center  waveguide. 

For  our  initial  demonstration  of  the  CDF  concept, 
we  are  working  with  a  Si02/Si3N4-layered  system. 
For  this  materials  system,  grating  periods  tailored 
for  a  center  optical  wavelength  of  1.55  pm  will  be 
approximately  515  nm.  X-ray  lithography  will  be 
used  to  transfer  the  grating  pattern  onto  ridge 
waveguides  fabricated  by  wet-chemical  etching. 
The  patterning  of  the  necessary  x-ray  mask  poses 
a  significant  challenge  to  existing  mask  patterning 
techniques.  In  particular,  the  CDF  requires 
spatially-coherent  gratings  spanning  hundreds  of 
microns,  with  precise  control  of  pitch  to  - 10  ®  and 
appropriately  placed  phase  steps  of  (71/2)  ±10-^ 
radians.  As  described  in  section  4.4,  we  are  devel¬ 
oping  a  spatial-phase-locked  electron-beam  lithog¬ 
raphy  technique  to  achieve  the  required  spatial 
fidelity  in  the  x-ray  masks. 

4.16  Novel  Superconducting  Tunneling 
Structures 

Sponsor 

Defense  Advanced  Research  Projects  Agency 
Consortium  for  Superconducting  Electronics 

Project  Staff 

Dr.  Jack  Chu,^  Professor  John  M.  Graybeal,  Dr. 
Bernard  S.  Meyerson,^  George  E.  Rittenhouse,  Pro¬ 
fessor  Henry  I.  Smith 

In  this  project,  we  are  studying  the  behavior  of 
short-channel  hybrid  superconductor-degenerate 
semiconductor-superconductor  (SSmS)  Josephson 
devices,  whose  geometry  is  designed  to  display 
quantum  electronic  interference  behavior.  These 
studies  include  both  theoretical  and  experimental 
examinations.  This  work  represents  the  first  exam¬ 
ination  of  quantum  interference  effects  for  Cooper 
pairs  via  tunneling  through  extended  resonant 
states.  The  characteristic  energy  scale  for  such  a 
device  would  be  set  by  the  quantum  confinement, 
and  consequently  may  provide  a  new  avenue  for 
obtaining  finite  gains  in  Josephson  three-terminal 
devices. 

By  numerically  solving  the  Bogoliubov-deGennes 
equations  for  an  SSmS  junction,  we  have  theore¬ 
tically  shown  that  the  interference  of  ballistic 
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Figure  23.  Structures  of  channel-dropping  filter  experiments  on  Si0j/Si3N4/Si02. 


Cooper  pairs  through  spatially  extended  resonant 
states  is  indeed  possible.  As  a  function  of  the 
Fermi  energy  in  the  degenerate  semi-conductor 
(i.e.,  as  a  function  of  electrostatic  gate  bias),  we 
obtain  striking  cusp-shaped  resonances  in  the 
Josephson  critical  current  corresponding  to  the 
Fabry-Perot  interference  criteria.  We  find  that  such 
interference  results  regardless  of  how  weakly 
coupled  the  quasi-bound  states  within  the  semicon¬ 
ductor  layer  are  to  the  superconducting  contacts. 
The  coupling  strength  determines  only  the  width 
and  height  of  the  cusp.  Our  calculations  were  per¬ 
formed  assuming  a  one-dimensional  system  with  a 
high-mobility  semiconductor  electron  gas.  The 
origin  and  nature  of  these  resonances  have  been 
carefully  analyzed,  and  differ  markedly  from  those 
found  in  conventional  non-superconducting  reso¬ 
nant  tunneling  structures. 

Note  that  our  numerical  solution  to  the 
Bogoliubov-deGennes  equations  can  treat  an  arbi¬ 
trary  potential  function  within  the  semiconductor. 
We  can  thus  consider  a  wide  range  of  interesting 
and  important  junction  geometries  including  con¬ 


duction  band  mismatches,  insulating  interface  or 
Schottky  barriers,  and  intentionally  introduced 
potentials  within  the  N-layer  which  can  enhance  the 
quantum  interferences.  A  key  advantage  of  our 
technique  is  that  it  yields  the  full  quasiparticle  exci¬ 
tation  spectrum  within  the  Sm-layer  as  a  function  of 
the  phase  difference  across  the  junction,  thus  pro¬ 
viding  important  insight  into  the  resultant  physics. 

We  are  exploring  the  fabrication  of  such  a  resonant 
tunneling  Josephson  junction  using  a  high-mobility 
Si:Sii-xGe,  two-dimensional  electron  gas  (2DEG). 
A  crucial  and  very  demanding  requirement  of  such 
a  resonant  SSmS  structure  is  that  the 
superconductor-semiconductor  interface  be  clean 
and  comparatively  defect-free.  The  preferred  elec¬ 
tronic  transport  channel  through  the  S-Sm  inter¬ 
faces  is  direct  tunneling,  as  inelastic  processes 
and/or  tunneling  through  atomic  localized  states  can 
be  highly  disruptive  to  the  Cooper  pair  coherence. 
Presently,  we  are  exploring  superconductor-2DEG 
contacts  using  an  intermediate  phosphorous- 
implanted  Si  region.  Optimizations  of  this  contact 
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scheme  are  underway  to  improve  their  transmission 
of  Cooper  pairs. 

4.17  Submicrometer-Period 
Transmission  Gratings  for  X-ray  and 
Atom-Beam  Spectroscopy  and 
Interferometry 

Sponsor 

Joint  Services  Electronics  Program 
Contract  DAAL03-92-C-0001 

Project  Staff 

James  M.  Carter,  Julie  C.  Lew,  Dr.  Mark  L. 
Schattenburg,  Professor  Henry  I.  Smith 

Transmission  gratings  with  periods  of  100-1000  nm 
are  finding  increasing  utility  in  applications  such  as 
x-ray,  vacuum-ultraviolet,  and  atom-beam  spectros¬ 
copy  and  interferometry.  Over  20  laboratories 
around  the  world  depend  on  MIT-supplied  gratings 
in  their  work.  For  x-ray  and  VUV  spectroscopy, 
gratings  are  made  of  gold  and  have  periods  of 
100-1000  nm  and  thicknesses  ranging  from 
100-1000  nm.  They  are  most  commonly  used  for 
spectroscopy  of  the  x-ray  emission  from  high- 
temperature  plasmas.  Transmission  gratings  are 
supported  on  thin  (1  pim)  polyimide  membranes  or 
made  self  supporting  ("free  standing")  by  the  addi¬ 
tion  of  crossing  struts  (mesh).  (For  short  x-ray 
wavelengths  membrane  support  is  desired,  while  for 
the  long  wavelengths  a  mesh  support  is  preferred  in 
order  to  increase  efficiency.)  Fabrication  is  per¬ 
formed  by  holographic  lithography,  x-ray  lithography 
and  electroplating.  Progress  in  this  area  focuses 
on  improving  the  yield  and  flexibility  of  the  fabri¬ 
cation  procedures. 

Another  application  is  the  diffraction  of  neutral 
sodium  beams  (de  Broglie-wavelength  -  17  pm)  by 
mesh-supported  gratings.  Professor  David 
Pritchard's  group  at  MIT  has  clearly  demonstrated 
atomic  diffraction  and  interference.  Because  good 
spatial  coherence  (low  distortion)  of  the  grating  is 
critical  to  ensure  measurable  Interference  of  the 
beams,  efforts  are  concentrated  on  the  use  of  holo¬ 
graphic  lithography  and  the  reactive-ion  etching  of 
free-standing  gratings  in  low  stress  and  high 
stiffness  materials  such  as  silicon  nitride.  This 
process  is  continuously  being  improved  via  an 
undergraduate  thesis  project.  We  also  will  use 
holographic  interferometry  (figure  5,  section  4.3)  to 


directly  measure  the  spatial-phase  fidelity  of  the 
free-standing  nitride  gratings. 


4.18  High-Dispersion,  High-Efficiency 
Transmission  Gratings  for 
Astrophysicai  X-ray  Spectroscopy 

Sponsor 

National  Aeronautics  and  Space  Administration 
Contract  NAS8-36748 

Project  Staff 

Professor  Claude  R.  Canizares,  Robert  C.  Fleming, 
Jr.,  Dr.  Mark  L.  Schattenburg,  Professor  Henry  I. 
Smith 

This  work  involves  a  collaboration  between  the 
Center  for  Space  Research  and  the  Submicron 
Structures  Laboratory  (SSL),  providing  transmission 
gratings  for  the  Advanced  X-ray  Astrophysics 
Facility  (AXAF)  x-ray  telescope,  currently  scheduled 
for  launch  in  1998.  Many  hundreds  of  low- 
distortion,  large-area  transmission  gratings  of  200 
nm  period  (gold)  and  460  nm  period  (silver)  are 
required.  These  will  provide  high  resolution  x-ray 
spectroscopy  of  astrophysicai  sources  in  the  100 
eV  to  10  keV  band. 

Because  of  the  requirements  of  low  distortion,  high 
yield,  and  manufacturability,  a  fabrication  procedure 
involving  the  replication  of  x-ray  masks  has  been 
selected.  Masks  are  made  of  high-stiffness  silicon 
nitride  membranes  to  eliminate  distortion  and  pat¬ 
terned  using  a  process  involving  holographic  lithog¬ 
raphy,  reactive-ion  etching,  and  electroplating.  The 
masks  are  then  replicated  using  soft  x-rays  (1  -  1.5 
nm)  and  the  resulting  patterns  electroplated  with 
gold  or  silver.  An  etching  step  then  yields 
membrane-supported  gratings  suitable  for  space 
use.  Flight  prototype  gratings  have  been  fabricated 
and  continue  to  undergo  space-worthiness  tests. 
Progress  in  this  area  focuses  on  increasing  the 
yield  and  flexibility  of  the  fabrication  procedures, 
and  the  perfection  of  various  mask  and  grating 
evaluation  tests. 


4.19  Submicron-Thickness  X-ray 
Window  Technology 
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National  Aeronautics  and  Space  Administration 
Grant  NAGW-2003 
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Schattenburg,  Professor  Henry  I.  Smith 

We  have  been  investigating  various  schemes  for 
fabricating  leak-free,  large-area,  ultrathin  mem¬ 
branes  to  serve  as  vacuum  isolation  windows  for 
the  transmission  of  x-rays.  Applications  include 
gas-filled  x-ray  detector  windows  and  high-vacuum 
isolation  windows  for  x-ray  lithography.  Current 
window  technology  uses  relatively  thick  beryllium 
windows  which  are  opaque  to  x  rays  with  wave¬ 
lengths  in  the  0.5  -  1.5  nm  band.  However,  this 
band  is  very  useful  for  x-ray  detector  and  x-ray 
lithography  applications.  Current  efforts  seek  to 
perfect  large-area  polyimide  windows  which  have  a 
1.0  micron  thickness.  When  combined  with 
tungsten  or  nickel  meshes,  these  have  been  made 
leak  free  to  the  limit  of  He  leak  detector  technology. 
Woven  tungsten  support  meshes  are  used,  and 
also  advanced  nickel  meshes  made  by  deep-etch 
x-ray  lithography  (so-called  LIGA  process),  in  col¬ 
laboration  with  the  MicroParts  company  in 
Germany.  Future  efforts  will  seek  to  reduce  the 
membrane  thickness  still  further,  and  also  exper¬ 
iment  with  silicon  nitride  membranes  which  promise 
to  be  leak  free  and  also  bakeable  for  high  vacuum 
applications.  A  silicon  nitride  isolation  window  is 
being  used  as  a  vacuum  isolation  window  in  our 
laboratory  x-ray  aligner.  New  high-strength  nitrides, 
which  we  have  developed  using  our  dedicated  in- 
house  LPCVD  system,  promise  to  further  improve 
window  transmission. 


4.20  GaAs  Epitaxy  on 
Sawtooth-Patterned  Silicon 
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The  growth  of  GaAs  on  Si  offers  the  possibility  of 
combining  high-speed  and  optoelectronic  GaAs 
devices  with  Si  integrated-circuit  technology.  Ori¬ 
ented  gratings  of  200  nm  period  are  fabricated  in 
SisNA  on  (100)  Si  substrates.  Anistrotropic  etching 
in  KOH  is  then  used  to  produce  sawtooth-profile 
gratings  in  the  Si.  Then  these  serve  as  substrates 


for  GaAs  growth  by  MOCVD.  The  dislocation 
density  in  the  grown  GaAs  films  is  orders  of  magni¬ 
tude  lower  than  the  density  in  films  formed  on 
planar  Si  substrates. 
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Additive  Pulse  Modelocking  (APM)  is  a  technique  of 
modelocking  that  simulates  saturable  absorber 
action  by  transforming  nonlinear  phase  modulation 
due  to  the  Kerr  effect  into  amplitude  modulation. 
Since  the  Kerr  effect  has  a  very  short  response 
time,  very  fast  absorption  characteristics  can  be 
achieved.  The  Kerr  effect  is  also  responsible  for 


self-focusing,  an  effect  used  in  the  Kerr  Lens 
Modelocking  (KLM). 

In  order  to  compare  the  two  modelocking  schemes, 
we  have  developed  an  analytic  theory  of  KLM.’ 
With  the  analytic  theory  of  APM  already  available,  it 
was  possible  to  make  comparisons  on  the  merits  of 
the  two  schemes.’ 

It  has  been  observed  by  several  researchers  that 
achievable  pulsewidths  are  critically  dependent 
upon  third  order  dispersion  of  the  system  (the 
dependence  of  the  group  velocity  dispersion  upon 
frequency).  We  have  gained  a  good  understanding 
of  the  process  by  extensive  computer  simulations 
and  an  analytic  treatment  that  is  capable  of  repro¬ 
ducing  the  main  features  of  the  simulation  results.^ 
Briefly  stated,  third  order  dispersion  generates 
sidebands  of  the  pulse  spectrum  because  the  dis¬ 
persion  characteristic  provides  phase  matching  of 
the  main  pulse  phase  velocity  with  the  phase 
velocity  of  a  spectral  sideband  of  the  pulse.  This 
effect  drains  energy  from  the  main  pulse  and 


’  H.A.  Haus.  J.G.  Fujimoto,  and  E.P.  Ippen,  "Analytic  Theory  of  Additive  Pulse  and  Kerr  Lens  Mode  Locking,"  IEEE  J.  Quant.  Elec¬ 
tron.  28;  2086-2096  (1992). 

2  H.A.  Haus,  J.D.  Moores,  and  L.E.  Nelson,  "Effect  of  Third-order  Dispersion  on  Passive  Mode  Locking,"  Opt.  Lett.  18:  51-53  (1993). 
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reduces  the  saturable  absorber  action.  A  surprising 
outcome  of  the  analysis  is  that,  when  third  order 
dispersion  effects  are  significant,  the  pulse  can  be 
shortened  by  filtering  its  spectrum.  This  filtering 
reduces  the  sideband  generation  and  thus  reduces 
its  damaging  effect. 

Another  theoretical  investigation  was  triggered  by 
observations  of  C.  Doerr,  a  graduate  student 
attempting  to  actively  modelock  an  erbium  fiber  ring 
laser  to  produce  high  repetition  rate  pulse  trains.  It 
was  hoped  that  the  relatively  short  solitons  (1  ps  or 
less)  in  the  laser  with  negative  group  velocity  dis¬ 
persion  could  be  forced  to  circulate  at  a  Gbit  rate. 
Mr.  Doerr  found  that  he  could  achieve  good  steady 
state  pulse  generation  with  100  ps  pulses,  but  not 
shorter  ones.  The  computer  simulations  made  by 
L.  Nelson,  aided  by  analytic  work,  showed  that  this 
is  consistent  with  theoretical  predictions.  The 
modelocked  pulses  can  be  shortened  somewhat  by 
the  Kerr  effect  in  the  fiber,  as  already  pointed  out  in 
our  previous  work.^  However,  when  much  shorter 
pulses  are  attempted  with  increased  Kerr  effect,  the 
system  is  forced  to  produce  pulses  closely  resem¬ 
bling  higher  order  solitons.  These  pulses,  of 
course,  change  continuously  and  cannot  lead  to  a 
steady  state  pulse  train  of  constant  amplitude.  This 
work  also  provides  a  good  physical  interpretation 
for  Haus  and  Silberberg,^  which  did  not  provide  a 
physical  explanation  of  the  results  of  its  computer 
simulations.  It  is  clear,  however,  that  both  very 
short  pulses  and  pulse  timing  via  active 
modelocking  are  achievable  when  APM  compo¬ 
nents  are  introduced  into  the  laser  system.  We 
shall  pursue  this  problem  both  theoretically  and 
experimentally. 
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Work  on  all-fiber  modelocked  lasers  is  being 
pursued  at  several  laboratories,  e.g.,  the  Naval 
Research  Laboratory  and  the  University  of 
Southampton.  We  have  concentrated  on  one  par¬ 
ticular  realization,  a  simple  unidirectional  additively 
pulse  modelocked  (APM)  ring  erbium  fiber  laser. 
The  gain  fiber  also  acts  as  the  nonlinear  Kerr 
element.  Fiber  polarization  transformers  and  fiber 
polarizer  and  isolator  make  this  an  all-fiber  system. 

K.  Tamura  was  successful  in  obtaining  stable  oper¬ 
ation  of  the  laser^  with  pulses  at  42  MHz  repetiton 
rate  (one  pulse  in  the  ring)  of  452  fs  duration.  The 
system  is  truly  self-starting  because  it  breaks  into 
pulse  operation  spontaneously  when  a  36  mw 
pump  threshold  is  reached  (cw  operation  is  initiated 
at  1 1  mw  pump  power).  The  self-starting  behavior 
is  in  agreement  with  our  previously  published 
theory®  that  showed  self-starting  impeded  by  small 
backscatter  in  the  laser  resonator.  The  scatter  dis¬ 
places  the  resonator  mode  spectrum  and  thereby 
calls  for  increased  injection  signals  into  the  indi¬ 
vidual  modes.  In  the  unidirectional  ring  only  a  com¬ 
bination  of  two  backscatters  (one  backward,  the 
other  forward)  displaces  the  resonator  mode  spec¬ 
trum,  and  hence  the  effect  is  greatly  reduced. 

The  very  successful  operation  of  this  laser  has 
encouraged  us  to  intensify  the  work  on  this  partic¬ 
ular  system.  We  will  implement  schemes  to  obtain 
stable  operation  with  more  than  one  pulse  within 
the  resonator,  and  we  will  work  toward  generation 
of  even  shorter  pulses. 

1.3  Long  Distance  Fiber 
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3  H.A.  Haus  and  Y.  Silberberg,  "Laser  Mode  Locking  with  Addition  of  Nonlinear  Index,"  IEEE  J.  Quant.  Electron.  QE-22'.  325-331 
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Last  year  we  showed  that  the  Gordon-Haus  effect 
that  sets  a  limit  on  long  distance  soliton  communi¬ 
cations  could  be  overcome  partially  by  filtering  the 
pulse  stream  at  every  amplifier  stage.®  The  carrier 
frequency  of  a  soliton  that  deviates  from  the  filter 
center  frequency  is  pushed  by  the  filter  toward  the 
filter  center  frequency.  Since  the  Gordon-Haus 
effect  is  a  noise-induced  random  walk  of  soliton 
carrier  frequency,  filters  help  to  reduce  this  effect. 
The  filters  also  help  in  reducing  background  noise. 
Indeed,  the  net  linear  bandwidth  of  the  transmission 
link  is  greatly  reduced  by  the  cascade  of  the  filters 
so  that  linear  high-bitrate  communications  are 
impossible.  Fiber  nonlinearity  allows  solitons  to 
recover  their  spectral  width  when  it  has  been 
reduced  by  filtering:  thus  high  bit-rate  soliton  pulse 
trains  can  be  transmitted.  The  same  proposal  was 
made  independently  by  Hasegawa  and  Kodama  at 
Bell  Laboratories.^  The  scheme  permits  wavelength 
multiplexing  if  the  filters  are  the  Fabry-Perot  type 
with  one  multiple  transmission  peak  per  channel. 

Nakazawa  of  NTT  demonstrated  a  pulse  train  trans¬ 
mission  that  also  extended  the  Gordon-Haus  limit, 
using  both  filters  and  modulators.®  This  scheme  is 
not  readily  adaptable  to  wavelength  multiplexed 
transmission.  However,  we  have  now  shown  that  it 
eliminates  the  Gordon-Haus  limit  entirely.®  A  pulse 
stream  of  ones  and  zeros  can  propagate  indefinitely 
in  a  fiber  ring  with  gain  to  compensate  for  the  loss 
and  with  filters  and  modulators  synchronized  to  the 
pulse  stream.  The  zeros  are  maintained  because 
the  modulators  spread  the  spectrum  of  the  noise 
accumulating  in  the  time  slots  occupied  by  zeros; 
then  the  filters  absorb  it.  A  fiber  ring  that  operates 
on  this  principle  can  be  used  as  a  high  bit-rate  all- 
optical  memory.  We  are  now  investigating  how  this 
memory  can  be  used  in  conjunction  with  all-optical 
switches,  delay  lines,  and  frequency  shifters  for  the 
conversion  of  Time  Division  Multiplexing  (TDM)  to 
Wavelength  Division  Multiplexing  (WDM)  and  back 
again. 


1.4  Squeezing 
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We  are  continuing  our  experiments  on  squeezing 
with  optical  fibers.  Comparison  of  our  results  with 
those  of  the  IBM  group  have  shown  that  the  degree 
of  squeezing  with  pulses  of  100  ps  duration  is  crit¬ 
ically  dependent  on  the  Guided  Acoustic  Wave 
Brillouin  Scattering  (GAWBS)  in  the  fiber.  The  fiber 
used  in  our  previous  experiments  had  a  "window"  of 
low  GAWBS  within  the  frequency  range  in  which  we 
observed  squeezing  (40-80  kHz).  We  did  not 
obtain  squeezing  with  a  fiber  given  to  us  by  the  IBM 
group  which  showed  much  higher  GAWBS  in  the 
same  frequency  regime.  This  finding  prompted  us 
to  look  for  ways  to  eliminate  GAWBS.  The  IBM 
group  has  already  proposed  and  demonstrated 
squeezing  with  reduced  GAWBS  using 
subpicosecond  pulses;  the  shorter  the  pulses,  the 
less  GAWBS  is  excited.  Another  advantage  pre¬ 
sented  by  shorter  pulses  of  a  given  energy  is  that 
squeezing  can  be  achieved  in  short  fibers  (<  1  m), 
further  reducing  GAWBS.  C.  Doerr  has  reproduced 
these  results  in  our  laboratory.  There  are  two  other 
ways  of  reducing  GAWBS: 

1.  A  rapid  phase  modulation  of  the  local  oscillator 
(  >  1  GHz)  can  cancel  out  the  GAWBS  upon 
detection.’® 


6  A.  Mecozzi,  J.D.  Moores,  H.A.  Haus,  and  Y.  Lai,  "Soliton  Transmission  Control,"  Opt.  Lett.  16: 1841-1843  (1991). 

r  Y.  Kodama  and  A.  Hasegawa,  "Generation  of  Asymptotically  Stable  Optical  Solitons  and  Suppression  of  the  Gordon-Haus  Effect," 
Opt.  Lett.  17:  31-33  {1992V 

8  M.  Nakazawa,  E.  Yamada,  H.  Kubota,  and  K.  Suzuki,  "10  Gbit/s  Soliton  Transmission  over  One  Million  Kilometres,"  Electron.  Lett. 
27;  1270-1272  (1991). 

9  H.A.  Haus  and  A.  Mecozzi,  "Long-term  Storage  of  a  Bit  Stream  of  Solitons,"  Opt.  Lett.  17:  1500-1502  (1992). 

'0  M.  Shirasaki  and  H.A.  Haus,  "Reduction  of  Guided-acoustic-wave  Brillouin  Scattering  Noise  in  a  Squeezer,"  Opt.  Lett.  17:  1225-1227 
(1992). 
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2.  A  rapid  repetition  rate  of  the  pulses  (1  GHz) 
puts  the  GAWBS  spectrum  into  a  frequency 
regime  that  does  not  convolve  to  low  frequen¬ 
cies  upon  detection. 

We  have  demonstrated  successfully  scheme  1,” 
which  was  employed  in  conjunction  with  a  phase 
measurement  in  a  Mach-Zehnder  interferometer 
achieving  a  noise  floor  3  dB  below  the  shot  noise 
level.  This  is  the  first  demonstration  of  the  use  of 
Kerr-squeezed  vacuum  for  an  improved  phase 
measurement.  We  are  presently  working  on 
scheme  2. 

The  theory  of  squeezing  requires  a  self-consistent 
analysis  of  nonlinear  optical  processes.  Every 
quantum  measurement  employs  a  nonlinear  device. 
With  a  self-consistent,  yet  simple  quantum  theory  of 
nonlinear  devices,  one  can  develop  a  self- 
consistent  theory  of  quantum  measurement.  Taking 
advantage  of  this  fact,  we  have  analyzed  some 
basic  quantum  measurements  in  which  we  have 
quantized  not  only  the  process  to  be  measured,  but 
also  the  measurement  equipment.  Paradoxes  of 
quantum  measurement,  some  of  long  standing,  can 
be  eliminated  in  this  way.'^ 


1.5  Integrated  Optics  Components: 

The  Channel  Dropping  Filter 
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The  Channel-Dropping  Filter  (CDF),  proposed  by 
Haus  and  Lai,’^  is  a  novel  narrow-band  integrated 
optical  filter  highly  suitable  for  wavelength-division 
demultiplexing  of  optically  transmitted  bitstreams. 
The  novel  architecture  of  the  CDF  allows  "in-line" 
filtering  of  a  WDM  bitstream.’^  This  is  in  contrast 


with  other  currently  proposed  filters,  all  of  which 
require  the  termination  of  the  entire  bitstream  to 
resolve  the  frequency  components.  Within  the  past 
year,  we  have  shown  the  application  of  the  CDF  for 
phase  control  of  optically  controlled  phased-array 
radars  and  wide  area  synchronous  clock  distribu¬ 
tion. 

The  CDF  is  an  integrated  optical  device  that  is  con¬ 
structed  with  three  parallel  waveguides  and  quarter- 
wave  shifted  distributed-feedback  (DFB)  gratings. 
The  center  guide,  the  bus,  carries  the  WDM 
bitstream.  On  the  two  side  guides,  the  receiver  and 
mirror,  spacially  coherent  DFB  gratings  are  pat¬ 
terned. 

Two  prototypes  of  the  CDF  are  being  developed  at 
MIT.  One  is  based  on  the  silica-on-silicon  materials 
system,  developed  in  the  Integrated  Circuits  Labo¬ 
ratory  at  MIT.  While  this  system  is  optically 
passive,  its  low  loss  will  allow  for  the  strong  optical 
resonances  required  for  proper  operation.  The 
other  is  based  on  the  InGaAsP/lnP  semiconductor 
materials  system,  developed  in  the  Photonic  Semi¬ 
conductors  Laboratory  in  collaboration  with  Pro¬ 
fessor  L.A.  Kolodziejski.  This  optically  active 
system  will  allow  for  optical  amplification  to  over¬ 
come  the  intrinsic  material  loss  as  well  as  allow 
tuning  of  the  filter  frequency.  In  both  cases,  x-rav 
nanolithography  is  necessary  to  pattern  the  5100  A 
and  2300  A  period  gratings  for  the  two  respective 
materials  systems.  The  x-ray  technology  has  been 
developed  in  the  Submicron  Structures  Laboratory 
in  collaboration  with  Professor  H.l.  Smith. 


1.6  Tunable  Semiconductor  Lasers 
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Wavelength  Division  Multiplexing  (WDM)  applica¬ 
tions  such  as  the  channel  dropping  filter  proposed 
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by  H.A.  Haus  and  Y.  Lai'®  require  tunable  lasers 
and/or  optical  amplifiers.  Simulations  we  performed 
have  shown  that  the  three  terminal  Independently 
Addressable  Asymmetric  Double  Quantum  Well 
(lAADQW)  structure  we  proposed  has  a  tuning 
range  of  more  than  20  nm  about  a  center  wave¬ 
length  of  1.5  nm.  This  is  significantly  larger  than 
the  continuous  tuning  range  of  both  three-section 
Distributed  Bragg  Reflector  (DBR)  and  Tunable 
Twin-Guide  (TTG)  lasers  that  are  competing  for  use 
in  WDM  systems.  In  addition  to  wide  tuning  range, 
the  lAADQW  is  a  three-terminal  device  so  light 
output  power  can  be  kept  constant  during  tuning. 

Fabrication  of  the  lAADQW  devices  has  begun 
using  a  self-aligned  technique  we  developed.  A 
single  photolithography  step  defines  a  metal  layer 
which  we  use  as: 

1.  An  etch  mask  for  the  definition  of  an  optical 
waveguide  using  Reactive  Ion  Etching  (RIE); 

2.  An  ion  implant  mask  that  we  use  to  confine  the 
current  injection  in  the  lower  quantum  well; 

3.  A  contact  to  the  upper  quantum  well. 

Use  of  this  technique  eliminates  critical  lithographic 
alignments  that  could  make  the  fabrication  of  three 
terminal  devices  prohibitively  complex. 

1.7  Nonlinear  Properties  of  InGaAsP 
Optical  Amplifiers 
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The  nonlinear  properties  of  semiconductor  optical 
amplifiers  are  of  interest  because  they  1)  influence 
the  modulation  response  and  mode  stability  of 
diode  lasers,  2)  limit  speed  and  produce  cross-talk 
between  multiplexed  signals  in  optical  amplifiers, 
and  3)  may  be  useful  for  the  design  of  nonlinear 
optical  waveguide  modulators  and  switches.  We 
use  femtosecond  optical  pulses,  tunable  near  1.5 
pm,  to  characterize  the  optical  nonlinearities  in  the 
gain  and  index  of  InG^sP  diodes  and  optical 
amplifiers. 

During  the  past  year,  we  measured  for  the  first  time 
the  femtosecond  gain  dynamics  of  strained-layer 
multiple-quantum-well  (SLMQW)  amplifiers  using 
the  novel  heterodyne  pump-probe  technique  we 
had  previously  developed.’®  The  heterodyne  tech¬ 
nique  is  essential  for  studying  SLMQW  devices 
because  their  gain  is  highly  anisotropic.  This 
anisotropy  is  due  to  the  polarization  selection  rules 
and  the  large  energy  separation  between  the  light- 
hole  and  heavy-hole  bands.  The  pump-probe 
results  show  a  response  consistent  with  spectral 
hole  burning  when  the  diode  is  biased  in  the 
absorption  regime.  In  the  gain  regime,  carriers  are 
heated  by  free-carrier  absorption,  and  there  is  a 
measurable  delay  (-  200  fs)  in  the  thermalization  of 
the  hot  carrier  distribution.  Subsequent  cooling  to 
the  lattice  temperature  follows  with  a  time  constant 
of  -  1  ps,  which  is  in  good  agreement  with  previous 
results  on  bulk  and  unstrained  MQW  InGaAsP 
optical  amplifiers. 

We  have  also  extended  the  heterodyne  pump-probe 
technique  to  measure  ultrafast  index  nonlinearities 
in  bulk  and  SLMQW  amplifiers.’^  For  these  exper¬ 
iments,  we  use  a  high-frequency  (ham)  radio 
receiver  (instead  of  a  spectrum  analyzer)  to  detect 
the  beat  between  reference  and  probe  pulses 
because  it  has  AM  and  FM  reception  at  the  same 
frequency.  This  new  technique  for  measuring  ultra¬ 
fast  index  nonlinearities  has  many  advantages  over 
time  division  interferometer  (TDI)  techniques.  It 


15  H.A.  Haus  and  Y.  Lai,  "Quantum  Theory  of  Soliton  Squeezing:  A  Linearized  Approach,"  J.  Opt.  Soc.  Am.  B  7:  386-392  (1990). 
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requires  no  stabilization  of  the  interferometer  arms, 
no  balanced  detector  to  eliminate  signals  due  to 
probe  amplitude  changes,  and  it  can  be  used  with 
either  copolarized  or  cross-polarized  pump-probe 
beams.  Our  results  suggest  that  index  nonlinear¬ 
ities  are  due  to  an  ultrafast  optical  Kerr  effect  as 
well  as  to  carrier  heating.  The  measured  02  in 
these  diodes  ranged  from  -  2  x  1 0- cm^A/V  to 
-  6  X  10' cm^AfV,  depending  on  the  pump-probe 
polarization  orientation  and  the  diode  active  struc¬ 
ture  (bulk  vs.  SLMQW).  These  are  the  first  pump- 
probe  measurements  of  index  nonlinearites  in 
InGaAsP  diodes.  Prelimmary  results  for  optical 
switching  applications  suggest  that  150  fs  pulses 
with  only  3  pj  of  pulse  energy  can  be  used  to 
induce  large  gain  (A  T/T  =  95  %)  and  index 
(AO  =  0.75:t)  changes  in  diodes  biased  at  the  trans¬ 
parency  point.  In  this  case,  the  gain  and  index 
nonlinearities  have  recovered  completely  after  5  ps. 

1.8  Ultrafast  Index  Nonlinearities  in 
Active  Waveguides 
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Active  optical  waveguides  exhibit  large  and  poten¬ 
tially  useful  ultrafast  nonlinearities.  They  are  attrac¬ 
tive  structures  for  use  in  applications  because  the 
carrier  density  in  the  active  region  is  easily  varied, 
and  thus,  the  magnitude  and  recovery  time  of  a 
given  nonlinearity  can  be  controlled.  For  example, 
ultrafast  index  of  refraction  nonlinearities  due  to 
nonequilibrium  carriers  could  become  the  basis  for 
a  controllable  all-optical  switch.  The  realization  of 
such  a  device  requires  knowledge  of  how  active 
waveguides  respond  to  optical  excitation. 

With  these  ideas  in  mind,  our  aim  for  this  project  is 
to  use  short  optical  pulses  to  investigate  and  char¬ 
acterize  the  gain  and  refractive  index  nonlinearities 


exhibited  by  active  waveguides.  The  waveguides 
we  study  consist  of  bulk  AIGaAs  diode  lasers 
(Hitachi  HLP1400)  with  anti-reflection  coatings 
deposited  on  the  facets.  Such  a  device  functions 
as  an  active  optical  waveguide  with  gain  provided 
by  bias  current  injected  into  the  device.  Our  pre¬ 
vious  pump-probe  studies  employed  a  dye  laser 
system  that  produced  optical  pulses  400 
femtoseconds  in  duration.  These  measurements 
revealed  that  transient  carrier  heating  and  the 
optical  Stark  effect  are  the  dominant  ultrafast 
refractive  index  nonlinearities  in  these  devices. 

We  recently  upgraded  our  source  to  a  modelocked 
Ti;sapphire  laser  system  that  produces  pulses  as 
short  as  70  femtoseconds.  The  short  pulses  and 
wide  tuning  range  afforded  by  this  new  laser 
system  have  enabled  us  to  perform  detailed  studies 
of  above-  and  below-band  gap  dynamics  in  active 
waveguides.’®  These  recent  studies  have  led  to 
further  interesting  findings  about  refractive  index 
nonlinearities.  In  a  series  of  pump-probe  measure¬ 
ments  conducted  with  pulses  tuned  below  the 
active  region's  band  gap  energy,  we  have  observed 
a  transient  caused  by  carrier  heating  with  a 
recovery  time  of  ~1 .3  picoseconds.  The  only 
method  by  which  carriers  can  be  heated  by  a 
below-band  photon  is  through  free  carrier  aborption. 
Thus,  our  results  indicate  that  significant  carrier 
heating  can  result  from  free  carrier  absorption.  In 
addition,  our  below-band  studies  have  also  shown 
that  there  is  a  delay  of  about  0.12  picoseconds  in 
the  turn-on  of  the  carrier  heating  dynamic.  This 
indicates  that  it  takes  some  finite  amount  of  time  for 
free  carriers  that  have  absorbed  photons  to  heat  up 
the  sea  of  carriers  sitting  at  the  bottom  of  the  band. 
We  have  also  observed,  in  above-  and  below-band 
measurements,  an  instantaneous  refractive  index 
transient  that  we  attribute  to  the  optical  Stark  effect. 
Our  measurements  show  that  this  instantaneous 
nonlinearity  experiences  a  resonant  enhancement 
as  the  pump  wavelength  is  tuned  toward  an  absorp¬ 
tion  edge.  Finally,  studies  of  above-ba.id  gain 
dynamics  have  revealed  evidence  for  spectral-hole 
burning. 

Our  experiments  reveal  a  great  deal  about  funda¬ 
mental  physical  processes  and  carrier  relaxation 
mechanisms  in  an  optically  excited  semiconductor. 
In  addition,  we  are  also  learning  about  potentially 
useful  nonlinearities  in  active  waveguides.  For 
instance,  transient  carrier  heating  and  the  optical 
Stark  effect  are  the  source  of  large,  ultrafast 
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electronics,  San  Francisco.  California.  January  1 993. 
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refractive  index  nonlinearities  that  may  be  useful  in 
the  development  of  practical  photonic  devices. 


1.9  Femtosecond  Pulse  Generation  and 
Amplification  in  F-center  Systems 

Sponsors 
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Project  Staff 

Professor  Erich  P.  Ippen,  Dr.  Jerome  M.  Paye,  Gadi 
Lenz,  Yuzo  Hirayama 

Modern  lightwave  communication  systems  use 
carrier  wavelengths  around  1.55  pm  because 
optical  fibers  have  minimum  loss  and  can  be  tai¬ 
lored  to  have  negligible  dispersion  in  this  spectral 
region.  The  characteristics  of  the  semiconductor 
lasers,  detectors,  optical  amplifiers,  and  modulators 
used  in  these  systems  depend  strongly  on  material 
properties  as  well  as  device  structure.  High  rates 
of  information  transmission  will  be  limited  in  some 
cases  by  the  nonlinearities  and  dynamic  behavior  of 
these  components.  At  the  same  time,  novel  proper¬ 
ties  of  these  1.5  pm  materials  and  structures  may 
make  new  device  architectures  possible.  To  inves¬ 
tigate  these  limits  and  new  possibilities,  we  have 
been  developing  a  unique  femtosecond  laser 
system.  Comprised  of  an  F-center  laser  oscillator 
and  high  power  amplifier,  it  will  provide  the  broadly 
tunable  source  and  continuum  spectroscopic  capa¬ 
bility  needed  for  such  studies. 

The  cw  femtosecond-pulse  oscillator  is  an 
NaChOH-  laser  with  a  center  wavelength  of  1.58 
pm  and  a  tuning  range  from  1.5  -  1.65  pm.  Pulses 
with  durations  of  about  100  fs  are  generated  using 
the  Additive  Pulse  Modelocking  (APM)  technique 
that  was  developed  in  our  lab.  With  this  particular 
laser,  we  have  also  discovered  a  self-starting  mode 
of  APM  that  exhibits  highly  stable  operation  and 
very  low  noise.  This  self-starting  under  cw  pumping 
is  particularly  useful  because  it  eliminates  the  noise 
associated  with  a  modelocked  pump  laser. 


Recently,  we  took  advantage  of  this  characteristic  in 
an  experiment  involving  the  generation  of  squeezed 
light.’® 

Typical  pulse  energy  at  the  output  of  our  laser  is  -1 
nJ.  This  is  sufficient  for  a  variety  of  pump-probe 
experiments,  but  for  many  experiments,  including 
continuum  generation,  higher  pulse  energies  are 
required.  This  year,  we  built  an  optical  amplifier  for 
the  purpose  of  reaching  these  energies.  We  have 
observed  a  gain  of  lO"  and  10  pJ  output  pulses 
using  this  system.  Our  amplifier  uses  a  long  NaCI: 
OH-  crystal  as  the  gain  medium.  The  crystal  is 
pumped  by  a  Q-switched  Nd:YAG  at  a  rate  of  1 
KHz.  The  amplified  pulse  passes  through  the 
amplifier  four  times  with  a  total  transit  time  still 
much  shorter  than  the  pump  pulse.  To  match 
oscillator  pulses  of  the  KHz  pump  repetition  rate, 
we  have  set  up  a  synchronized  electro-optic  pulse 
selection  system.  With  further  optimization  we 
expect  to  achieve  as  much  as  100  pJ  per  pulse. 
These  energies  will  allow  continuum  generation 
making  possible  new  time  resolved  spectroscopy 
experiments.  These  experiments  will  yield  new 
information  on  the  basic  physics  of  optical  devices. 


1.10  Coherent  Phonons  In  Electronic 
Materials 
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Very  large  amplitude  coherent  optical  phonons  can 
be  induced  in  semimetals  and  narrow-gap  semicon¬ 
ductors  and  observed  using  pulses  of  light  that  are 
short  in  duration  compared  to  the  characteristic 
periods  of  lattice  vibrations.  The  excitation  mech¬ 
anism  for  coherent  phonons  in  these  materials  is 
attributed  to  a  rapid  displacement  of  the  ion  equilib¬ 
rium  coordinate  induced  by  electronic  excitation.^® 
Our  current  interest  in  coherent  phonons  is  two-fold: 
(1)  using  time-resolved  femtosecond  spectroscopy 
to  further  our  understanding  of  lattice  dynamics  in 


'9  C.R.  Doerr,  I.  Lyubomirsky,  G.  Lenz,  J,  Paye,  H.A,  Haus,  and  M.  Shirasaki,  "Optical  Squeezing  with  a  Short  Fiber,"  paper  presented 
at  QELS  ’93,  Baltimore,  Maryland,  May  1993. 

20  T.K.  Cheng,  J.  Vidal,  H.J.  Zeiger,  E.P.  Ippen,  G.  Dresselhaus,  and  M.S.  Dresselhaus,  "Mechanism  for  Displacive  Excitation  of 
Coherent  Phonons,"  Appl.  Phys.  Lett.  59:  1923  (1991);  H.J.  Zeiger,  J.  Vidal,  T.K.  Cheng,  E.P.  Ippen,  G.  Dresselhaus  and  M.S. 
Dresselhaus,  "Theory  for  Displacive  Excitation  of  Coherent  Phonons,"  Phys.  Rev.  BAS:  768  (1992). 


Chapter  1.  Optics  and  Quantum  Electronics 


electronic  materials,  (2)  exploiting  the  coherent 
phonon  phenomena  to  modulate  the  physical  char¬ 
acteristics  of  the  material  at  THz  frequencies.  We 
are  carrying  out  this  work  in  collaboration  with  Pro¬ 
fessor  M.  Dresselhaus  and  her  group  in  the  Center 
for  Materials  Science  and  Engineering. 

Our  most  recent  experiments  involving  high  excita¬ 
tion  densities  have  shown  that  a  solid,  upon  optical 
pulse  excitation,  can  undergo  coherent  oscillation  at 
frequencies  which  are  shifted  significantly  (greater 
than  10  percent)  from  their  low  excitation  values. 
Furthermore,  we  have  studied  the  detailed 
dynamics  of  how  the  coherent  phonon  evolves  from 
its  initial  frequency-shifted  state  back  to  the  equilib¬ 
rium  state.  These  time-domain  measurements 
provide  information  inaccessible  via  other 
spectroscopic  techniques  and  reveal  important  non¬ 
equilibrium  behavior  of  coherent  phonons  related  to 
lattice  anharmonicity  and  carrier  screening. 

A  highlight  of  this  past  year  is  that  we  showed,  for 
the  first  time,  the  possibility  of  coupling  coherent 
phonon  dynamics  to  electron  dynamics  in  the  limit 
of  large  vibrational  amplitudes.^'  For  example,  we 
investigated  TijOa,  a  narrow  gap  semiconductor 
which  undergoes  a  semiconductor-semimetal  transi¬ 
tion  as  a  function  of  temperature.  On  the  basis  of 
x-ray,  Raman,  and  reflectivity  measurements,  we 
deduced  that  the  coherent  phonons  we  excite  with 
femtosecond  pulses  can  modulate  the  interatomic 
distances  in  Ti203  by  as  much  as  two  percent.  In 
principle,  a  three  percent  modulation  is  enough  to 
modulate  the  entire  semiconductor-semimetal  tran¬ 
sition.  Experimental  work  in  progress  is  directed  at 
further  developing  this  notion  of  modulating  the 
physical  characteristics  of  a  solid  at  THz  frequen¬ 
cies  via  optically  induced  coherent  phonons. 

1.11  Femtosecond  Studies  of 
Fullerenes 

Sponsors 
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Contract  DAAL03-92-C-0001 
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The  soccerball-shaped  molecule  Ceo  with  its  high 
symmetry  has  generated  considerable  theoretical 
as  well  as  experimental  research.  Some  of  this 
interest  has  been  inspired  by  the  discovery  of 
superconductivity  with  a  high  transition  temperature 
of  18  K  and  28  K  for  this  third  form  of  carbon  if 
doped  with  potassium  and  rubidium,  respectively. 

The  high  symmetry  (In)  of  Ceo  imposes  strong  selec¬ 
tion  rules  on  phonon  mediated  excited  states  transi¬ 
tions.  It  is  therefore  interesting  to  study  the 
processes  of  electronic  and  vibrational  energy 
decay  in  the  excited  singlet  and  triplet  states  in  Ceo 
versus  those  for  the  Cis  ions  in  doped  samples 
which  are  metallic  and  of  lower  symmetry.  A  funda¬ 
mental  question  is  whether  the  fullerenes  with  their 
unusual  structure  present  excited  state  features  that 
are  distinct  from  other  aromatic  molecules.  Optical 
pump-probe  experiments  allow  us  to  monitor  the 
dynamics  of  the  scattering  between  excited  states 
and  thus  give  information  about  the  electronic  struc¬ 
ture  as  well  as  the  strength  of  the  coupling  to 
phonons.  We  are  performing  such  studies  in  col¬ 
laboration  with  Professor  M.  Dresselhaus  and  with 
Professor  P.  Ecklund  of  the  University  of  Kentucky. 

The  undoped  (Ceo)  and  doped  (KsCeo.  RbaCeo)  films 
used  for  our  pump-probe  measurements  were  thin 
homogeneous  polycrystalline  films  deposited  on 
quartz  substrates  with  a  typical  film  thickness  of 
2000  A.  To  prevent  oxygen  contamination  all  the 
samples  were  sealed  off  in  helium.  Femtosecond 
dynamics  were  induced  and  observed,  in  reflectivity 
and  transmission  at  k=625  nm,  using  pulses  from  a 
colliding-pulse  modelocked  (CPM)  dye  laser.  Both 
the  K  and  Rb  doped  films  show  approximately  the 
same  fast  decay  on  the  order  of  0.2  ps.  This  is 
much  faster  than  the  monitored  response  of  Ceo. 
which  exhibits  a  decay  rate  on  the  order  of  30  ps, 
with  a  fast  initial  component  on  the  order  of  1  ps.^^ 

The  difference  in  these  decay  rates  reflects  the 
molecular  character  of  Ceo  and  the  band-like  char¬ 
acter  of  MaCeo  (M=K,  Rb).  We  attribute  the  meas¬ 
ured  nonexponential  decay  of  the  Ceo  films  to  the 
relaxation  of  excited  state  carriers  from  their  initial 
singlet  states  to  lower  lying  excited  triplet  states. 
This  is  a  relatively  slow  process  because  the  scat¬ 
tering  is  only  on-ball  via  coupling  to  vibrational 


2'  T.K.  Cheng,  L,  Acioli,  E.P.  Ippen,  J.  Vidal,  H.J.  Zeiger,  G.  Dresselhaus,  and  M.S.  Dresselhaus,  "Modulation  of  a  Semiconductor-fo- 
Semimetal  Transition  at  Seven  Teraherz  via  Coherent  Lattice  Vibrations,"  Appl.  Phys.  Lett.,  forthcoming. 

22  S  B.  Fleisher,  E.P.  Ippen,  G.  Dresselhaus,  M.S.  Dresselhaus,  A.M.  Rao,  P.  Zhou,  and  P.C.  Ecklund,  "Femtosecond  Optical  Dynamics 
of  MjCw,"  Appl.  Phys.  Lett,  forthcoming 


78  RLE  Progress  Report  Number  135 


Chapter  1 .  Optics  and  Quantum  Electronics 


modes.  For  the  doped  samples  the  half  filled 
LUMO-derived  conduction  band  gives  rise  to  parity 
allowed  pump-induced  transitions  of  electrons  to 
even  higher  lying  bands.  KaCeo  and  RbaCeo  are 
metallic  and  therefore  exhibit  properties  related  to 
narrow  bands  rather  than  vibrationally  broadened 
molecular  states  (as  in  the  case  of  Ceo)-  Subse¬ 
quent  to  their  optical  excitation,  the  carriers  created 
in  both  the  initial  and  final  states  undergo  thermal 
equilibration  within  each  band  via  Coulomb  scat¬ 
tering  and  also  on  a  slower  or  competing  time  scale 
via  electron-phonon  coupling.  Both  processes  also 
scatter  excited-state  electrons  out  of  their  initial 
states  into  states  with  different  symmetry  and  thus 
modulates  the  absorption  cross  section.  For  a 
more  detailed  understanding  of  the  electronic  struc¬ 
ture  and  the  scattering  mechanisms  in  these  mate¬ 
rials,  wavelength  selective  pump-probe 
investigations  will  be  performed. 


1.12  Ultrashort  Pulse  Generation  in 
Solid  State  Lasers 
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1.12.1  Fundamental  Limits  to  Ultrashort 
Pulse  Generation 

The  development  of  techniques  for  the  generation 
of  ultrashort  laser  pulses  is  essential  for  studies  of 
ultrafast  pheonomena  as  well  as  high  speed  optical 
communications  and  signal  processing  applications. 
The  problem  of  exploring  the  limits  of  short  pulse 
modelocked  laser  performance  is  an  important 


issue  in  femtosecond  optics.  The  time  resolution  of 
ultrafast  measurements  is  limited  only  by  the 
shortest  laser  pulse  durations  than  can  be  gener¬ 
ated.  In  addition,  the  bandwidths  available  from 
short  pulses  make  possible  the  use  of  spectral 
shaping  and  synthesis  techniques. 

Recent  advances  have  made  it  possible  to  generate 
pulse  durations  of  less  than  1 00  fs  using  third  gen¬ 
eration  solid  state  laser  technology.  Solid  state 
lasers  feature  high  energy  storage  and  high  power 
operation  and  can  be  engineered  into  a  compact 
and  relatively  low  cost  short  pulse  technology.  The 
next  milestone  in  ultrafast  phenomena  will  be  the 
development  of  solid  state  laser  systems  which  can 
reliably  generate  pulse  durations  of  a  few 
femtoseconds.  Advancing  the  performance  of  solid 
state  laser  technology  will  require  fundamental 
studies  of  pulse  generation  and  pulse  shaping 
mechanisms  as  well  as  the  development  of  new 
pulse  generation  techniques.  During  the  past  year 
our  research  has  focussed  on  exploring  the  funda¬ 
mental  limits  for  short  pulse  generation  in  solid 
state  lasers  and  developing  techniques  which  have 
the  potential  for  generating  pulse  durations  of  less 
than  1 0  fs. 

The  fundamental  issue  that  underlies  the  develop¬ 
ment  of  sub  10  fs  lasers  is  how  can  one  acheive 
the  minimum  possible  pulsewidth  and  utilize  the  full 
bandwidth  that  is  available  from  a  given  laser  gain 
media.  TiiAIzOs  is  an  especially  attractive  material 
for  the  study  of  femtosecond  pulse  generation 
because  of  its  large  gain  bandwidth.  The  tuning 
range  of  TiiAIzOs  is  from  670  nm  to  greater  than 
1000  nm.  From  the  energy  time  uncertainty 
relationship,  the  pulsewidth  scales  inversely  with 
the  bandwidth  of  the  medium.  More  specifically, 
the  analytic  theory  of  Additive  Pulse  Modelocking 
(APM)  as  developed  by  Haus  et  al.^^  predicts  that 
the  shortest  pulse  scales  inversely  with  the  gain 
bandwidth  multiplied  by  the  square  root  of  the  satu¬ 
rable  absorption.  Assuming  a  saturable  absorber 
loss  modulation  of  1  percent,  the  bandwidth  of 
Ti:Al203  should  support  pulses  of  10  fs.  In  addition, 
the  use  of  intracavity  self-phase-modulation  and 
negative  group  velocity  dispersion  can  further 
reduce  the  pulse  width  by  a  factor  of  three.^^  Thus, 
in  theory,  pulses  of  3  fs  should  be  acheivable 
directly  from  a  TiiAIzOs  laser. 


23  H.A.  Haus,  J.G.  Fujimoto,  and  E.P.  Ippen,  "Structures  for  Additive  Pulse  Modelocking,"  J.  Opt.  Soc.  Am.  B  8:  2068-2076  (1991). 
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1.12.2  Short  Pulse  Generation  using 
intracavity  Dispersion  Compensation 

One  central  issue  in  the  generation  of  extremely 
short  pulse  durations  is  the  control  of  very  broad 
bandwidth  optical  signals.  The  spectral  width  of  a 
laser  pulse  is  inversely  proportional  to  its  duration; 
in  the  10  fs  regime  for  a  pulse  in  the  near  infrared, 
the  pulse  bandwidth  will  be  more  than  60  nm  or 
almost  ten  percent  of  the  carrier  frequency.  One 
approach  for  optimizing  laser  performance  is  to 
compensate  intracavity  dispersion.  Previous  results 
in  pulse  compression  studies^"*  and  in  laser 
modelocking^s  have  shown  that  control  of  higher 
order  dispersion  is  essential  in  short  pulse  genera¬ 
tion.  Second  order,  or  group  velocity,  dispersion 
may  be  compensated  with  a  pair  of  either  prisms  or 
gratings.  The  low  loss  of  prisms  inserted  at 
Brewster's  angle  has  allowed  their  use  inside  laser 
cavities;  they  are  essential  components  of  both  col¬ 
liding  pulse  modelocked  dye  lasers  and  Kerr  lens 
mode  locked  solid  state  lasers.  While  the  dominant 
effect  of  both  prism  and  grating  pairs  is  to  correct 
second  order  dispersion,  they  simultaneously  intro¬ 
duce  third  order  effects.  Luckily,  the  sign  of  the 
third  order  contribution  of  gratings  is  opposite  to 
that  of  prisms.  A  combination  of  gratings  and 
prisms  was  used  in  an  extracavity  pulse  com¬ 
pression  experiment  to  produce  6  fs  pulses,  the 
current  world  record.^" 

Working  in  collaboration  with  CVI  Inc.,  we  have 
designed  and  built  a  thin  film  optical  element  that 
introduces  only  third  order  dispersion  and  has  low 
loss  necessary  for  intracavity  use.  Our  device,  a 
Gires-Tournois  interferometer  (GTI),  is  fabricated  on 
a  high  reflecting  mirror  substrate  and  permits  the 
independent  control  second  and  third  order  intra¬ 
cavity  dispersion.  The  amount  of  third  order  disper¬ 
sion  may  be  selected  by  varying  the  number  of 
bounces  of  the  laser  beam  off  the  GTI.  This  tech¬ 
nique  has  allowed  us  to  reduce  the  pulse  duration 
in  a  Kerr  lens  modelocked  TiiAIzOs  laser  from  45  fs 
to  28  fs.  We  have  performed  detailed  studies 
which  characterize  the  intracavity  dispersion  in  the 
TiiAIzOs  laser  in  order  to  demonstrate  the  role  of 


third  order  dispersion  compensation  for  the  opti¬ 
mization  of  femtosecond  laser  performance.^® 

Approaches  for  optimizing  pulse  duration  perfor¬ 
mance  have  emerged  as  an  active  area  of  research 
in  recent  months.  Since  our  demonstration  of  the 
GTI  as  a  third  order  dispersion  compensating 
device,  a  number  of  other  approaches  for  control¬ 
ling  dispersion  have  also  been  investigated  by  us 
as  well  as  other  research  groups.  These  tech¬ 
niques  include  the  use  of  specialized  glasses  for 
prisms  and  multiple  prism  schemes.  The  current 
record  for  pulsewidth  using  this  approach  is  1 1  fs.^^ 
Further  improvements  in  tne  pulse  duration  gener¬ 
ated  directly  from  a  laser  will  probably  require  other 
pulse  compression  and  shaping  techniques  as  well 
as  laser  intracavity  dispersion  compensation. 

In  addition  to  optimizing  laser  performance,  we  are 
also  investigating  alternative  strategies  for  gener¬ 
ating  short  pulses.  Our  studies  focus  on  exploiting 
the  short  pulse  durations  and  high  peak  intensity  of 
the  pulses  for  external  pulse  compression.  We  are 
pursuing  two  approaches  to  achieve  this  com¬ 
pression  and  study  its  limits.  We  have  recently  built 
a  Kerr  lens  modelocked  Ti:Alz03  laser  which  uses 
minimized  third  order  dispersion  to  generate  pulse 
durations  of  less  than  30  fs.  The  high  peak  ener¬ 
gies  and  short  pulsewidths  of  these  pulses  suggest 
that  good  compression  factors  may  be  achieved 
using  using  optical  fiber  pulse  compression  tech¬ 
niques  to  attain  durations  of  10  fs  or  less  with  pulse 
energies  of  1  to  2  nj  at  repetion  rates  of  100  Mhz. 
Investigation  of  these  pulse  compression  tech¬ 
niques  and  factors  which  govern  their  performance 
are  currently  undenway. 


1.12.3  Cavity  Dumped  Femtosecond  Laser 
Sources 

Another  complementary  approach  for  short  pulse 
generation  is  the  development  and  investigation  of 
techniques  for  cavity  dumping  modelocked 
femtosecond  solid  state  lasers.  These  studies  are 
important  because  they  provide  increased  pulse 
energies  without  using  amplification.  These 
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increased  pulse  energies  from  this  laser  will  make  it 
ideal  for  pulse  compression  as  well  as  for 
femtosecond  studies  of  nonlinear  processes. 

In  preliminary  studies  using  the  Kerr  lens 
modelocked  TiiAlaOa  laser  we  have  demonstrated 
cavity  dumping  using  an  intracavity  acousto-optic 
Bragg  cell.  Pulse  durations  as  short  as  50  fs  with 
too  nJ  pulse  energies  were  generated  at  a  900 
KHz  repetition  rate.  In  addition  to  the  generation  of 
high  energy  short  pulses,  cavity  dumping  provides 
an  approach  for  studying  the  pulse  formation 
dynamics  in  the  laser.  One  of  the  surprising 
findings  of  our  preliminary  studies  was  that  rela¬ 
tively  high  dumping  rates  can  be  achieved  without 
significant  disruption  of  the  modelocking  and  pulse 
formation  process. 

Further  studies  are  currently  undenway  to  extend 
these  techniques  to  yield  pulse  durations  of  less 
than  30  fs  with  higher  pulse  energies.  The 
development  of  a  cavity  dumped  Kerr  lens 
modelocked  TiiAIzOa  laser  is  significant  because  it 
represents  a  simpler  and  lower  cost  alternative 
approach  to  amplification  for  achieving  increased 
pulse  energies. 


1.13  Studies  of  Ultrafast  Phenomena  in 
Materials 

Sponsors 

Joint  Services  Electronics  Program 
Contract  DAAL03-91-C-0001 
National  Science  Foundation 
Fellowship  ECS-85-52701 
U.S.  Air  Force  Office  of  Scientific  Research 
Contract  F49620-91-C-0091 
U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-91-J-1956 

Project  Staff 

Malini  Ramaswamy,  Chi-Kuang  Sun,  Dr.  Lucio  H. 
Acioli,  Dr.  Fabrice  Vallee,  Professor  Erich  P.  Ippen, 
Professor  James  G.  Fujimoto 


1.13.1  Femtosecond  Carrier  Dynamics  in 
AIGaAs 

The  design  and  optimization  of  high  speed  elec¬ 
tronic  and  optoelectronic  devices  depends  on  know¬ 
ledge  of  the  dynamics  of  transient  electron  and  hole 
populations  in  semiconductor  materials.  Carrier 
dynamics  determine  the  fundamental  limits  of 
device  speed.  For  example,  intervalley  electron 
scattering  is  largely  responsible  for  the  high  field 
transport  properties  of  GaAs  and  AIGaAs.  For  ideal 
device  design  it  should  be  possible  to  predict  the 
time  and  energy  evolution  of  a  excited  carrier  dis¬ 
tribution  in  the  bandstructure  of  a  semiconductor. 
Electron-electron  scattering  times  are  as  short  as 
several  femtoseconds.  Therefore,  an  accurate 
model  of  carrier  behavior  must  have  the  ability  to 
predict  transient  effects  on  the  femtosecond  time 
scale. 

Working  in  collaboration  with  solid  state  theoretical 
physicists  at  the  University  of  Florida,  we  have 
developed  a  collaborative  research  program  which 
combines  state  of  the  art  experimental  and  theore¬ 
tical  techniques  to  perform  detailed  investigations  of 
carrier  dynamics  in  GaAs  and  AIGaAs  on  a 
femtosecond  scale.  Using  femtosecond  laser 
experimental  studies  at  MIT  and  ensemble  Monte 
Carlo  computer  simulations  at  the  University  of 
Florida,  we  have  been  able  to  experimentally 
measure  and  theoretically  model  the  various  scat¬ 
tering  processes  and  relaxation  mechanisms  of 
excited  carrier  distributions.  The  result  of  our  work 
is  a  model  that  may  be  extended  to  predict  tran¬ 
sient  carrier  dynamics  in  a  variety  of  different  exci¬ 
tation  conditions  in  semiconductors. 

Last  year,  we  performed  a  series  of  experimental 
studies  on  AIGaAs  using  a  tunable  femtosecond 
laser  system.^®  By  systematic  variation  of  the  wave¬ 
length  and  spectrum  of  the  femtosecond  laser 
pulses,  the  evolution  of  optically  excited  carrier  dis¬ 
tributions  initially  generated  in  different  regions  of 
the  band  can  be  systematically  investigated.  The 
laser  system  used  for  these  investigations  is  an 
amplified  colliding  pulse  modelocked  (CPM)  laser. 
Optical  pulses  as  short  as  35  femtoseconds  from 
the  CPM  are  amplified  to  microjoule  energies  in  a 
two  stage  dye  amplifier  pumped  by  a  copper  vapor 
laser.  The  amplified  pulses  are  then  focused  on  a 
flowing  jet  of  ethylene  glycol  to  produce  a 
femtosecond  wnite  light  continuum.  A  Fourier  fre¬ 
quency  filter  is  used  to  select  the  desired  spectrum 
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from  the  continuum.^®  Fourier  synthesized  pulses 
may  be  generated  with  arbitrary  spectral  or 
temporal  profiles:  in  our  experiment  we  are  primarily 
concerned  with  wavelength  and  bandwidth  since 
they  determine  the  initial  state  of  carriers  excited  to 
the  conduction  band. 

Using  this  laser  source,  pump  probe  studies  were 
performed  on  AIGaAs  samples.  The  mole  fraction 
of  Al  in  the  AIGaAs  sample  was  chosen  to  vary  the 
bandstructure  and  isolate  specific  optical  transitions. 
The  result  of  the  experimental  part  of  our  investi¬ 
gation  is  comprehensive  set  of  pump  prooe  differ¬ 
ential  transmission  traces.  The  data  span  a  range 
of  wavelengths  and  excited  carrier  densities,  and 
are  recorded  for  the  first  thousand  femtoseconds 
after  initial  excitation. 

In  another  study  we  investigated  excited  carrier 
dynamics  in  the  presence  of  a  cold  carrier  plasma.’® 
This  situation  is  analogous  to  the  conditions  found 
in  a  diode  laser.  We  developed  a  novel  three  pulse 
pump  probe  technique  to  investigate  the  transient 
carrier  dynamics  after  the  generation  of  carriers 
with  a  pre-pulse.  The  first  pulse  excites  carriers 
which  relax  to  a  quasiequilibrium,  cold  distribution 
before  the  pump  probe  measurement  takes  place. 
In  this  experiment  we  varied  the  bandstructure  of 
the  sample  instead  of  tuning  the  femtosecond  laser. 
With  this  technique  we  investigated  effects  above 
and  below  the  cold  plasma  Fermi  level  and  the  role 
of  intervalley  scattering.  The  presence  of  the  cold 
plasma  induces  only  a  small  increase  in  the  hot 
carrier  thermalization  rate  even  for  high  cold  plasma 
densities.  In  fact,  the  thermalization  of  a  low 
density,  hot  carrier  distribution  interacting  with  a 
cold,  high  density  photoexcited  plasma  occurs  on  a 
time  scale  longer  than  intervalley  scattering  in 
GaAs. 

While  the  femtosecond  pump  probe  technique  pro¬ 
vides  the  highest  temporal  resolution  available 
today,  the  interpretation  of  the  experimental  results 


is  often  complex.  We  use  state  of  the  art  computer 
simulations  for  two  purposes:  1)  to  keep  track  of  the 
large  number  of  optical  transitions  and  scattering 
channels  in  semiconductor  experiments,  and  2)  to 
deduce  fundamental  physical  information,  such  as 
the  time  dependent  carrier  distributions,  from  differ¬ 
ential  transmission  data.  Simple  analytical  models 
provide  physical  insight,  but  do  not  yield  detailed 
correspondence  with  experimental  results.  Our  the¬ 
oretical  model  begins  with  a  30  band,  full  zone  k  • 
p  calculation  of  the  semiconductor  bandstructure. 
Anisotropy  and  hole  warping  effects  are  included. 
An  ensemble  Monte  Carlo  .simulation  of  40,000 
electrons  and  noles  tracks  the  time  development  of 
the  distribi'iion  functions.®'  The  final  step  is  a  calcu¬ 
lation  of  ihe  differential  transmission  which  is  then 
compared  directly  with  the  experimental  traces. 
Fundamental  parameters  such  as  the  deformation 
potential  constant  and  carrier  scattering  rates  are 
then  deduced.  A  further  advantage  of  the  simu¬ 
lation  technique  is  that  the  strengths  of  different 
scattering  channels  may  be  artificially  varied  and 
the  sensitivity  of  the  data  to  specific  processes  may 
be  assessed. 


1.13.2  Nonequilibrium  Electronic  Effects  in 
Metals 

Studies  of  interactions  of  free  carriers  between 
themselves  and  their  envi'-onment  constitute  one  of 
the  major  problems  of  solid  state  physics.  This  has 
been  addressed  directly  in  the  time  domain 
employing  femtosecond  techniques  both  in  semi¬ 
conductors  and  metals.  Metals  are  especially  inter¬ 
esting  because  they  have  a  very  high  electron 
density  and  their  behavior  can  be  modeled  rela¬ 
tively  simply.  The  possibility  of  creating  and 
probing  a  transient  nonequilibrium  electron  popu¬ 
lation  in  metal  with  ultrashort  laser  pulses  has  been 
demonstrated  by  different  groups.®®  In  previous 
experiments  electron-electron  interactions  were 
assumed  to  be  sufficiently  fast  to  thermalize  the 
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electron  gas  on  a  time  scale  of  the  order  or  shorter 
than  the  laser  pulse  duration,  although  some  devi¬ 
ations  from  an  instantaneous  response  were 
observed.33  Recent  investigations  using  transient 
photoemission  have  demonstrated  the  existence  of 
non-Fermi  electron  distributions  with  thermalization 
times  as  long  as  600  fs.  These  results  were 
observed  in  gold  film  for  large  changes  of  the  elec¬ 
tron  temperature  (of  the  order  of  400°K)  with  a 
limited  time  resolution.^-*  Similar  conclusions  were 
drawn  at  lower  laser  fluence  by  analyzing  the  tem¬ 
perature  dependence  of  the  optically  measured 
electron-phonon  interaction  time  in  gold  and  silver.^ 

In  order  to  analyze  the  effect  of  the  non- 
instantaneous  electron-electron  interaction  on  the 
optical  response  of  a  metal  film,  recently  we  have 
performed  transient  reflectivity  and  transmissivity 
measurements  of  electron  temperature  dynamics.^® 
Studies  were  performed  in  the  perturbative  regime 
where  the  measured  changes  in  reflectivity  and 
transmissivity  can  be  directly  connected  to  the  elec¬ 
tron  distribution.  In  contrast  to  previous  exper¬ 
iments,  we  use  a  multiple  wavelength  femtosecond 
pump-probe  technique  to  excite  the  electron  gas 
with  a  infrared  pulse  (from  a  mode-locked  TirAIzOs 
laser  with  120  fs  pulsewidth  and  880-1065  nm 
wavelength)  and  probe  changes  in  optical  proper¬ 
ties  using  frequency  doubled  wavelength  in  the 
visible  (440-532.5  nm).  This  technique  permits  a 
more  definitive  measurement  of  electron  dynamics 
by  separating  the  effects  of  pump  induced  transi¬ 
tions  from  those  monitored  by  the  probe.  Exper¬ 
iments  were  performed  on  thin  gold  films  because 
the  band  structure  of  gold  is  relatively  well  known 
and  band  to  band  transition  can  be  easily  probed 
by  a  frequency  doubled  TiiAIzOa  femtosecond 
laser. 


Measurements  show  evidence  for  non-Fermi  elec¬ 
tron  distributions  with  a  election  thermalization  time 
of  -500  fs  and  an  electron-lattice  cooling  time  of  1 
ps,  independent  of  the  laser  fluence  in  the  range  of 
2.5  mJ/cm2  -  200  mJ/cm^  (corresponding  to  esti¬ 
mated  peak  temperature  change  3-200°K).3®  A 
simple  model  based  on  carrier  population  changes 
and  curved  d-band  was  developed  to  describe  the 
behavior  of  the  system.  The  diffusioit  in  the  opti¬ 
cally  thin  sample  is  neglected  and  the  e'ectron  dis¬ 
tribution  change  is  simply  seperated  to  ♦  vo  parts: 
Fermi  and  non-Fermi,  r-iupled  by  an  electron  gas 
thermalization  term.  Excellent  qualitative  corre¬ 
spondence  between  theory  and  experiment  was 
obtained. 

Our  measured  electron  thermalization  time  is  com¬ 
parable  to  the  previous  estimations.®^  On  the  oas’s 
of  the  very  high  electron  density  of  metals,  one 
would  expect  very  fast  electron-electron  inter¬ 
actions.  However,  the  interaction  efficiency  is  con¬ 
siderably  reduced  by  phase  space  filling  which 
blocks  most  of  the  energetically  possible  interaction 
channels  and  by  screening  which  considerably 
reduces  the  efficiency  of  the  Coulombic  interaction. 
Our  results  are  significant  because  they  provide 
some  of  the  first  detailed  information  on  non-Fermi 
electron  dynamics  in  metals  and  demonstrate  new 
experimental  techniques  for  their  investigation. 

1.14  Time  Domain  Diagnostics  of 
Waveguide  Devices 
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1.14.1  Studies  of  Gain  Dynamics  in 
Strained  Layer  Diodes 

Nonlinear  gain  and  transient  carrier  dynamics  in 
diode  lasers  are  important  because  they  influence 
laser  linewidth,  modulation  bandwidth,  amplication, 
and  short  pulse  generation.  Previous  investigators 
have  performed  picosecond  and  femtosecond  pump 
probe  measurements  of  nonlinear  gain  dynamics  in 
bulk  GaAs,3®  InGaAsP  MQW,^^  and 
InGaAs/InGaAsP  strained-layer  MOW  amplifiers.^® 
These  studies  have  shown  that  nonequilibrium 
carrier  temperature  effects  play  an  important  role  in 
carrier  dynamics.  Carrier  temperature  changes  are 
caused  by  a  number  of  processes  including  state 
filling  produced  by  stimulated  transitions,  free 
carrier  absorption,  and  two  photon  absorption.  Pre¬ 
vious  studies"®  have  shown  that  free  carrier  absorp¬ 
tion  play  a  dominant  role  in  carrier  heating.  A  ~ 
150  fs  relaxation  time  had  been  observed  which 
was  attributed  to  either  spectral  hole  burning  or  turn 
on  delay  of  free  carrier  heating.  In  addition  to  non¬ 
linear  gain,  carrier  dynamics  also  produce  nonlinear 
index  effects  such  as  self  phase  modulation."' 


Recently  we  invented  a  new  technique  for  per¬ 
forming  independent  multiple  wavelength  pump- 
probe  measurement  in  waveguide  devices."^  The 
output  of  a  Kerr  lens  mode-locked  TirAIzOs  laser 
was  coupled  to  an  optical  fiber  to  produce  a  self 
phase  modulation  broadened  spectral  bandwidth  - 
50  nm  FWHM.  After  the  fiber,  the  beam  was  split 
into  a  pump  and  a  probe,  which  were  directed  into 
two  spectral  windowing  assemblies"®  to  produce 
synchronous  independent  different-frequency 
pulses.  This  approach  represents  a  powerful 
method  for  performing  experimental  studies  since 
the  pump  and  probe  wavelengths  can  be  varied 
independently  and  thus  energy  relaxation  dynamics 
can  be  explicitly  measured. 

Working  in  collaboration  with  investigators  at  MIT 
Lincoln  Laboratory,  we  have  performed  the  first 
studies  of  carrier  dynamics  in  InGaAs/GaAs 
strained  layer  quantum  well  diode  lasers.  Strained 
layer  quantum  well  devices  represent  one  of  the 
most  active  and  technologically  promising  areas  of 
current  optoelectronics  device  research.  Strained 
layer  materials  provide  an  added  degree  of  freedom 
by  allowing  the  epitaxial  growth  of  nonlattice- 
matched  materials.  High  power,  high  efficiency, 
long  lifetime,  and  low  threshold  current  density"" 
semiconductor  lasers  have  been  achieved  in 
InGaAs  strained  layer  devices. 

Our  studies  were  performed  using  a  broad-area 
InGaAs/AIGaAs  graded-index  separate-confinement 
heterostructure  single  quantum  well  (GRIN-SCH 
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SOW)  ridge  waveguide  diode  laser"*®  with  a 
bandgap  near  950  nm.  Carrier  heating  was  studied 
by  tuning  the  pump  pulse  above  and  below  the 
bandgap  while  keeping  all  the  other  conditions 
(including  the  probe)  constant.  The  high  sensitivity 
of  the  system  allowed  us  to  perform  the  experiment 
in  the  low  perturbation  regime.  Different  heating 
and  cooling  times  were  measured  which  reflect  dif¬ 
ferent  thermalization  mechanisms  for  above  and 
below  band  pumping.  This  result  shows  that  free 
carrier  absorption  is  not  always  the  dominant  carrier 
heating  process  and  stimulated  transition  induced 
heating  plays  an  important  role  in  the  carrier 
heating  process,  at  least  in  strained  InGaAs/AIGaAs 
systems."*®  These  studies  represent  the  first  pump- 
probe  measurement  in  InGaAs/AIGaAs  strained 
layer  systems.  This  is  also  the  first  multiple  wave¬ 
length  time-domain  diagnostics  in  waveguide  struc¬ 
tures. 

Femtosecond  gain  dynamics  was  also  investigated 
using  multiple  wavelength  techniques.^^  Instead  of 
shifting  the  pump  pulse,  the  bias  current  was  varied 
so  that  gain,  transparency,  and  absorption  was  pro¬ 
duced  at  the  pump  wavelength.  The  probe  wave¬ 
length  was  in  the  gain  region  for  all  the  cases. 
Carrier  temperature  changes  mediated  by  both  free- 
carrier  absorption  and  stimulated  transitions  were 
observed.  Stimulated  transition  induced  carrier 
cooling  was  observed  in  the  absorption  region. 
This  was  the  first  observation  of  carrier  cooling  in 
GaAs  based  devices.  An  increased  understanding 
of  the  physical  mechanisms  of  gain  dynamics  from 
carrier  temperature  changes  is  important  for  the 
design  of  new  devices.  In  particular,  the  reduction 
of  carrier  heating  has  important  implications  for 
reducing  parasitic  gain  saturation  effects  in  short 
pulse  modelocked  laser  diodes  and  amplifiers. 


1.15  Laser  Medicine 

Sponsors 

National  Institutes  of  Health 
Grant  NIH-5-R01-GM35459-08 
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Project  Staff 
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1.15.1  The  Ultrashort  Pulse  Scalpel 

The  objective  of  our  program  is  to  continue 
development  of  an  optimized  ultrashort  pulse  laser 
scalpel  for  precise  intraocular  microsurgery.  Over 
the  last  ten  years,  short  optical  pulses  have  been 
successfully  applied  for  non-invasive  cutting  of 
intraocular  structures,  for  example  in  posterior 
capsulotomy  and  iridotomy.  Through  the  mech¬ 
anism  of  laser  induced  breakdown,  photodisruption 
or  cutting  of  intraocular  structures  is  made  possible 
without  the  need  for  intervening  surgical  incision.  To 
date,  the  majority  of  clinical  applications  of  laser 
induced  optical  breakdown  have  utilized 
nanosecond  pulses  in  the  millijoule  energy  range 
and  single  pulse  exposures.^  However,  mechanical 
side  effects  of  laser  induced  breakdown  with 
nanosecond  sources  pose  potential  hazards  to 
adjacent  ocular  structures  and  tissues.  Our  investi¬ 
gations  have  demonstrated  that  significant  reduction 
of  collateral  tissue  damage  may  be  achieved 
through  the  use  of  ultrashort  pulses.  These  studies 
are  part  of  an  ongoing  collaboration  between  inves¬ 
tigators  at  MIT,  the  New  England  Eye  Center  of 
New  England  Medical  Center  Hospitals,  and  the 
Wellman  Laboratories  of  Photomedicine  at  Mass¬ 
achusetts  General  Hospital. 

The  physical  processes  which  occur  in  laser 
induced  breakdown  include  plasma  formation, 
acoustic  wave  generation,  and  cavitation.  We  have 
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Studied  and  compared  the  mechanisms,  scaling 
behavior,  and  tissue  effects  of  single  pulses  in  the 
nanosecond  and  picosecond  ranges/^  In  general, 
nanosecond  and  picosecond  optical  breakdown 
results  in  comparable  damage  zones  if  the  same 
amount  of  energy  is  deposited;  however,  the 
threshold  energy  for  breakdown  is  much  lower  for 
picosecond  pulses,  and  near-  threshold  picosecond 
pulses  produce  greatly  reduced  collateral  damage 
zones.  For  example,  we  have  demonstrated  collat¬ 
eral  damage  ranges  in  a  corneal  endothelial  ceil 
model  of  only  100  microns  with  40  picosecond 
duration  pulses  at  8  microjoules  pulse  energy.  We 
have  also  studied  tissue  effects  (corneal  incisions) 
into  the  femtosecond  domain.®®  Ultrashort  pulses 
with  high  peak  intensities  can  produce  plasma- 
mediated  ablation  of  transparent  tissues,  such  as 
the  cornea.  Picosecond  and  femtosecond  pulse 
durations  have  been  demonstrated  to  produce 
much  smoother  excision  edges  and  less  damage  to 
the  adjacent  tissue  than  nanosecond  pulses. 

Following  these  initial  studies,  we  have  developed  a 
clinically  viable  picosecond  laser  scalpel  based  on  a 
modelocked,  Q-switched  Nd;YAG  laser  with 
external  pulse  selection.  This  laser  delivers  single 
100  picosecond  duration  pulses  at  a  repetition  rate 
variable  from  3  to  1000  Hz;  each  pulse  produces 
minimal  collateral  damage,  while  multiple  pulses 
produce  a  cumulative  incision  effect.  With  this 
laser,  we  achieve  optical  breakdown  in  the  deep 
vitreous  with  less  than  70  microjoules  pulse  energy, 
compared  to  millijoules  used  in  current  clinical 
Q-switched  systems.  We  have  performed  in  vitro 
and  in  vivo  studies  of  vitreous  membrane  surgery 
with  the  high  repetition  rate  picosecond  laser. 
Membrane  surgery  of  the  deep  vitreous  is  a  very 
challenging  procedure  because  of  the  proximity  of 
the  retina  and  the  sensitivity  of  the  retina  to  thermal 
and/or  mechanical  injury.  Using  an  in  vitro  vitreous 
membrane  model,  we  have  demonstrated  that 
repetitive  picosecond  pulses  can  produce  a  fine 
linear  incision  with  -  100  micron  cut  width. 
Mechanical  disruption  posterior  to  the  laser  focus  is 
also  confined  to  less  than  200  microns.  We  have 
also  performed  picosecond  laser  microsurgery  in 
vivo  in  experimental  vitreous  membranes  in  rabbit 
eyes.  Cuts  were  successfully  made  through  mem¬ 


branes  at  distances  varying  from  <  100  microns  to 
3  mm  from  the  retina  without  causing  vitreous 
hemorrhage  and  with  only  minor  cell  disruption 
visible  upon  histological  examination.  Based  in  part 
on  these  results,  the  FDA  has  approved  an  initial 
clinical  trial  of  picosecond  laser  vitreous  membrane 
surgery  in  human  patients.  We  have  also  extended 
the  use  of  picosecond  lasers  to  glaucoma  surgical 
procedures  such  as  iridectomy  and  ab  externo 
sclerostomy. 

In  addition  to  further  clinical  studies  using  our 
current  picosecond  laser  system,  we  are  also  devel¬ 
oping  new  laser  technology  designed  specifically  for 
optimizing  highly  localized  photodisruption  in  trans¬ 
parent  structures  using  laser  induced  breakdown. 
These  studies  are  being  performed  using  a 
flashlamp-pumped  titanium-sapphire  laser  tunable 
from  700-1000  nm  at  20  Hz  repetition  rate. 
Flashlamp  pumped  solid  state  lasers  feature  higher 
pulse  energy  and  lower  cost  compared  to  cw  laser 
Dumped  lasers.  Using  a  combination  of  active 
modelocking  and  passive  optical  limiting  for  sup¬ 
pression  of  relaxation  oscillations,  we  have  obtained 
modelocked  pulses  of  approximately  100 
picoseconds  duration  with  millijoule  energy.  Further 
studies  should  reduce  the  pulse  duration  to  singie 
picoseconds,  using  passive  modelocking  and  new 
techniques  for  controlling  pulse  shaping  and  pulse 
energy  using  nonlinear  intracavity  elements.  These 
new  techniques  employ  refractive  index  nonlinear¬ 
ities  and  intracavity  aperturing  to  achieve  saturable 
absorption  or  saturable  gain.  If  successful,  the 
wavelength  tunability  and  short  pulse  duration  avail¬ 
able  from  this  laser  system  will  outperform  existing 
Nd:YAG  and  Nd:YLF  laser  technology  for  a  wide 
range  of  laser  medical  applications. 

In  addition  to  direct  modelocking  we  are  also  devel¬ 
oping  a  variable  pulse  duration,  regeneratively 
amplified  titanium-sapphire  laser.  The  100  fs,  80 
MHz,  nJ  energy  Kerr-lens-modelocked  oscillator 
described  in  a  previous  section  is  being  incorpo¬ 
rated  as  a  seed  laser  for  chirped-pulse  regenerative 
amplification  in  the  flashlamp-pumped  titatium- 
sapphire  rod.  The  completed  laser  will  have  vari¬ 
able  pulse  duration  (100  fs  -  300  ps),  wavelength 
tunable  (700  -  1000  nm)  output.  Using  this  source, 
studies  will  be  performed  to  correlate  tissue  incision 
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and  collateral  injury  effects  with  laser  parameters 
(pulse  duration,  wavelength,  and  repetition  rate). 
Studies  will  include  time  resolved  measurements  of 
the  fundamental  physical  processes  in  optical 
breakdown,  as  well  as  tissue  effects.  Special 
emphasis  will  be  placed  on  investigating  pulse 
durations  shorter  than  1 00  picoseconds  in  order  to 
reduce  the  pulse  energy  needed  to  achieve  optical 
breakdown  below  the  70  microjoule  level  used  in 
our  previous  study.  This  system  will  serve  as  a  val¬ 
uable  and  unique  tool  for  optimizing  the 
localizability  and  incision  rate  for  uitrashort  pulse 
intraocular  laser  surgery. 


1.15.2  Optical  Coherence  Tomography 

Optical  coherence  tomography  (OCT)  is  a  novel 
non-contact,  non-invasive,  high  resolution  tomo¬ 
graphic  imaging  technology  that  we  have  developed 
for  the  measurement  of  microstructures  in  biological 
tissues.®'  OCT  can  achieve  a  spatial  resolution  of 
10  micrometers,  which  is  more  than  a  factor  of  10 
better  than  current  clinical  tomographic  imaging 
modalities  such  as  CT,  MRI,  or  ultrasound  B  mode 
imaging.  The  operation  of  OCT  is  analogous  to 
ultrasound  imaging.  An  incident  optical  beam  is 
reflected  or  backscattered  from  tissue  structures. 
The  roundtrip  delay  of  the  returned  light  is  meas¬ 
ured  to  derive  the  depth  of  the  reflecting  structure. 
This  delay  measurement  is  carried  out  using  an 
interferometric  ranging  technique  called  optical 
coherence  domain  reflectometry  (OCDR).®^  By 
incorporating  a  lateral  scanning  mechanism  to  a 
high  speed  OCDR  system,  a  two  dimensional 
cross-sectional  map  of  tissue  backscattering  magni¬ 
tude  is  obtained.  This  data  is  presented  in  false- 
color  or  gray  scale  as  an  optical  coherence 
tomograph.  Our  OCT  studies  have  concentrated  on 
the  measurement  of  eye  structures  because  the 
transparency  of  the  ocular  media  permit  easy 
optical  access,  and  because  OCT  has  the  potential 
of  obtaining  clinical  useful  information  in  very  thin 
eye  structures  such  as  the  retina  and  the  cornea 
that  are  not  resolvable  using  other  imaging 
methods. 


This  research  project  spans  several  different  areas 
including  technology  development,  physical  studies, 
and  studies  in  biological  systems  in  vitro  and  in 
vivo.  To  address  these  issues,  studies  are  per¬ 
formed  in  collaboration  with  investigators  at  the 
Optical  Communications  Group  at  MIT  Lincoln  Lab¬ 
oratory,  and  the  New  England  Eye  Center  of  the 
Tufts  University  School  of  Medicine. 

Our  initial  results  on  OCT,  published  in  Science,®® 
demonstrated  the  application  of  OCT  in  ophthal¬ 
mology  by  performing  imaging  of  the  retina  in  vitro. 
The  problem  of  noninvasive  measurement  of  the 
retinal  thickness  and  morphology  is  relevant  to  the 
diagnosis  of  glaucoma  and  other  retinal  diseases. 
In  addition,  as  a  example  of  OCT  imaging  in  a 
highly  scattering  system,  we  have  performed 
studies  in  coronary  arteries  in  vitro. 

During  the  past  year,  we  conducted  our  first  in  vivo 
studies  of  the  retinal  imaging  in  human  subjects. 
These  OCT  images  of  the  retina  and  the  optic 
nerve  head  have  at  least  ten  times  higher  depth 
resolution  than  conventional  ocular  imaging 
methods  such  as  ultrasound,  laser  scanning  tomog¬ 
raphy,  and  x-ray  computed  tomography.  These  in 
vivo  studies  were  made  possible  by  the  develop¬ 
ment  of  a  high  speed  OCT  system  with  image 
acquisition  times  of  several  seconds.  The  OCT 
system  was  coupled  via  a  optical  fiber  and 
galvanometer  driven  transverse  scanning  mech¬ 
anisms  to  a  slitlamp  biomicroscope,  a  commonly 
used  instrument  for  clinical  examination  of  the  eye. 
We  are  continuing  to  improve  the  speed  and 
robustness  of  the  system  in  preparation  for  clinical 
studies  on  glaucoma  and  other  vitreo-retinal  dis¬ 
eases.  We  expect  to  upgrade  the  image  acquisition 
time  to  1-4  seconds  in  the  near  future  to  minimize 
patient  efforts  in  clinical  studies. 

High-resolution  imaging  of  retinal  structures  is  clin¬ 
ically  relevant  to  the  diagnosis  and  management  of 
a  variety  of  retinal  diseases  such  as  glaucoma, 
macular  degeneration,  macular  hole,  and  macular 
edema.  Non-invasive  OCT  imaging  of  retinal  struc¬ 
tures  in  human  subjects  was  demonstrated  this 


5’  D.  Huang.  E.A.  Swanson,  C.P.  Lin,  J.S.  Schuman,  W.G.  Stinson,  W.  Chang.  M.R.  Hoe,  T.  Flotfe,  K.  Gregory,  C.A.  Puliafito  and  J.G. 
Fujimoto,  "Optical  Coherence  Tomography,"  Sci.  254:  1178  (1991). 

52  R.C.  Youngquist,  S.  Carr,  and  D.E.N,  Davies.  "Optical  Coherence-domain  Reflectometry;  a  New  Optical  Evaluation  Technique,"  Opt. 
Lett.  12:  158  (1987):  K.  Takada,  I.  Yokohama,  K.  Chida,  and  J.  Noda,  "New  Measurement  System  for  Fault  Location  in  Optical 
Waveguide  Devices  Based  on  an  Interferometric  Technique,"  Appl.  Opt.  26:  1603  (1987);  D.  Huang,  J.  Wang.  C.P.  Lin,  C.A.  Puliafito 
and  J.G.  Fujimoto,  "Micron-resolution  Ranging  of  Cornea  Anterior  Chamber  by  Optical  Reflectometry."  Lasers  Surg.  Med.  11:  419 
(1991);  E.A.  Swanson,  D.  Huang.  M.R.  Hee.  J.G.  Fujimoto,  C.P.  Lin.  and  C.A.  Puliafito,  "High-speed  Optical  Coherence  Domain 
Reflectometry."  Opt.  Lett.  17:  151  (1992). 

53  D.  Huang,  E.A.  Swanson.  C.P.  Lin,  J.S.  Schuman,  W.G.  Stinson,  W.  Chang,  M.R.  Hoe,  T.  Flotte,  K.  Gregory,  C.A.  Puliafito  and  J.G. 
Fujimoto,  "Optical  Coherence  Tomography,"  Sci.  254: 1178  (1991). 


Chapter  1 .  Optics  and  Quantum  Electronics 


year.  These  tomographs  have  far  higher  resolution 
than  other  non-invasive  imaging  techniques  and,  to 
our  knowledge,  are  the  first  images  that  actually 
allow  one  to  delineate  retinal  layers  in  vivo. 

In  OCT  images  of  the  foveal  region  of  the  retina, 
the  retina  layers  were  identifiable  from  the  previ¬ 
ously  known  anatomy.  The  backscattering  is  very 
weak  from  the  foveolar  region,  where  the  retina 
consists  mainly  of  the  photoreceptor  layer  and  the 
Henie  fiber  layer.  This  contrasts  with  the  sur¬ 
rounding  region,  where  the  brighter  nerve  fiber  layer 
and  the  plexiform  layers  of  the  retina  appear  to 
gradually  increase  in  thickness  further  from  the 
center  of  the  fovea.  Other  posterior  eye  structures 
such  as  the  vitreous  medium,  the  choroid,  and  the 
sclera,  are  also  clearly  identifiable.  The  ability  of 
OCT  to  image  these  retinal  structural  details  dem¬ 
onstrates  its  potential  in  the  diagnosis  and  quantita¬ 
tive  monitoring  of  retinal  diseases  such  as  macular 
degeneration,  macular  hole,  and  macular  edema. 

In  OCT  images  of  the  retina  around  the  optic  nerve 
head,  the  retinal  nerve  fiber  layer  (RNFL),  appear 
as  a  thick,  highly  scattering  layer  in  the  inner  retina 
that  contrasts  with  the  subjacent  retinal  layers. 
Accurate  RNFL  thickness  measurements  are  rele¬ 
vant  to  the  diagnosis  and  management  of 
glaucoma.®^  Glaucoma,  where  elevated  intraocular 
pressure  produces  retinal  nerve  fiber  atrophy,  is  a 
leading  cause  of  blindness.  The  diagnosis  of 
glaucoma  is  a  difficult  clinical  problem.  Intraocular 
pressure  measurements  do  not  reliably  predict 
disease  progression.  Other  diagnostic  parameters 
such  as  visual  field  defects  and  optic  disc  cupping 
are  detectable  only  after  significant  (-  50  %)  loss  in 
the  retinal  nerve  fiber  layer  (RNFL)  has  already 
occured.  Because  both  medical  and  surgical  treat¬ 
ments  for  glaucoma  can  cause  substantial  adverse 
effects,  there  is  a  great  deal  of  uncertainty  and 
dilemma  in  making  early  treatment  decisions.  Early 
detection  of  optic  nerve  fiber  atrophy  with  OCT 
could  greatly  reduce  this  uncertainty.  Further  trials 
in  glaucoma  patients  and  glaucoma  animal  models 
will  determine  whether  OCT  measurements  can 


provide  early  criteria  by  which  to  make  treatment 
decisions  in  glaucoma  patients. 


1.15.3  Transillumination  Imaging 

Optical  transillumination  imaging  of  tissue  offers  the 
potential  of  a  non-invasive  diagnostic  with  non¬ 
ionizing  radiation  and  the  possibility  of  using 
spectroscopic  properties  to  distinguish  tissue 
pathology  and  probe  metabolic  function.  Optical 
transillumination  is  analogous  to  x-ray  imaging; 
however,  optical  radiation  is  relatively  safe  and  can 
provide  tissue  contrast  based  on  spectroscopic  dif¬ 
ferences.  Unfortunately,  absorption  and  multiple 
light  scattering  severely  limit  optical  image  resol¬ 
ution  in  dense  or  thick  tissue  specimens.  Recently, 
there  has  been  interest  in  transillumination  as  an 
alternative  or  supplement  to  mammography  for 
early  breast  tumor  detection.  Optical  radiation  may 
be  capable  of  distinguishing  malignant  tissue 
without  the  hazards  of  x-ray  imaging  based  on  the 
optical  properties  of  the  surrounding  tumor  neovas¬ 
cularization. 

Several  time  or  spatially  resolved  optical  imaging 
techniques  have  been  proposed  which  reduce 
image  degradation  due  to  scattering  by  discrimi¬ 
nating  against  the  spatial  or  temporal  characteristics 
of  scattered  light.  Spatially  resolved  methods, 
based  on  the  confocal  imaging  principle,  require 
detected  light  to  have  a  certain  directionality  or 
spatial  phase  coherence.®*  Time  resolved  tech¬ 
niques  rely  on  the  fact  that  multiply  scattered  light 
travels  a  longer  path  through  tissue  compared  to 
unscattered  light,  and  therefore  may  be  rejected 
temporally.  A  short  pulse  propagating  through  a 
scattering  media  will  be  severely  lengthened  due  to 
scattering,  but  a  shadowgram  of  hidden  objects 
may  be  reconstructed  by  temporally  isolating  the 
earliest  arriving,  least  scattered  component  of  the 
transmitted  light  from  the  later-arriving  diffuse,  or 
multiply  scattered  light.  Any  of  a  number  of  dif¬ 
ferent  incoherent  or  coherent  imaging  techniques 
may  be  used  to  time-gate  early  arriving  light, 
including  streak  camera  imaging,®®  photon 
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counting, nonlinear  gating,®®  and  time-resolved 
interferometry®®  or  holography.®® 

In  collaboration  with  MIT  Lincoln  Laboratory,  we 
have  developed  a  new  technique  for  time-gated 
imaging  through  turbid  media  based  on  optical 
coherence  tomography  (OCT).®^  In  femtosecond 
transillumination  optical  coherence  tomography, 
femtosecond  pulses  and  coherent  heterodyne 
detection  time-resolve  transmitted  light  based  on  its 
coherence  properties.®^  The  operation  principle  of 
transillumination  OCT  is  analogous  to  reflection¬ 
mode  OCT;  however,  instead  of  depth  resolving 
reflected  light,  the  coherence  properties  of  trans¬ 
mitted  light  are  used  to  transversely  resolve  objects 
otherwise  obscured  by  multiple  scattering.  Short 
pulses  (50  to  400  fs)  from  a  Kerr-lens  modelocked 
TirAbOs  laser  are  used  in  conjunction  with  a  modi¬ 
fied  fiber-optic  Mach-Zehnder  interferometer.  A 
fiber  beamsplitter  divides  the  incident  pulses  into  a 
reference  delay  path  and  a  sample  transmission 
path.  Light  retroreflected  from  the  variable  delay 
reference  mirror  is  recombined  with  temporally 
broadened  light  transmitted  through  the  sample  at 
another  fiber  splitter  and  is  incident  on  a 
photodetector.  Interference  signal  is  created  at  the 
detector  only  for  the  coherent  component  of  the 
transmitted  pulse  which  temporally  overlaps  the  ref¬ 
erence  pulse.  Using  dual  balanced  detectors  to 
reduce  excess  laser  noise  in  conjunction  with 
lock-in  amplification,  we  achieve  the  quantum  shot 
noise  detection  limit.  The  transillumination  system 
can  resolve  transmitted  light  as  weak  as  5 
femtowatts,  which  is  10-'®  of  the  incident  optical 
power  of  50  mW  (a  130  dB  dynamic  range). 


To  study  the  basic  characteristics  of  coherent 
photon  migration  through  scattering  media,  we  have 
used  transillumination  OCT  to  temporally  profile  the 
coherent  constituent  of  the  on-axis  scattered  pulse 
propagating  through  suspensions  of  uniform  scat¬ 
tering  microspheres.  Similar  to  earlier  studies  of 
incoherent  migration,®®  the  coherent  transmitted 
light  consists  of  an  unscattered  ballistic  component, 
and  a  later  arriving,  temporally  broadened  diffuse 
component.  The  arrival  time  and  temporal  profile  of 
these  components  have  been  investigated  as  func¬ 
tions  of  scattering  anisotropy  and  microsphere  con¬ 
centration,  and  have  been  qualitatively  correlated 
with  predictions  from  Mie  scattering  theory.  Com¬ 
pared  to  previous  studies  of  incoherent  photon 
migration,  the  coherent  diffuse  component  displays 
a  reduced  temporal  extent  due  to  loss  of  phase 
coherence  with  multiple  scattering.  The  ballistic 
peak  power  attenuates  exponentially  as  the  number 
of  scattering  mean-free-paths  (MFPs)  contained  in 
the  scattering  medium,  or  equivalent  sample  thick¬ 
ness.  The  transmitted  diffuse  power  dominates 
ballistic  light  for  sample  thicker  than  about  25  scat¬ 
tering  MFPs,  fairly  independent  of  scattering 
anisotropy.  The  diffuse  power  also  attenuates 
about  a  factor  of  1 0  more  slowly  in  the  exponential 
with  increasing  sample  thickness. 

Time-gated  imaging  of  hidden  objects  embedded  in 
scattering  media  is  achieved  by  fixing  the  reference 
arm  path  length  and  raster  scanning  the  sample  to 
create  a  two-dimensional  image  of  interference 
signal  magnitude.  High  resolution  (100  ^m)  images 
of  an  Air  Force  resolution  chart  placed  in  the  center 
of  a  1  pm  diameter  microsphere  suspension  27 
scattering  MFPs  thick  have  been  obtained  by  using 
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transillumination  OCT  to  coherence-gate  only  the 
early  arriving  ballistic  component  of  the  transmitted 
light.  Since  ballistic  light  is  unscattered,  the  spatial 
resolution  of  ballistic  gated  imaging  is  theoretically 
diffraction  limited.  However,  imaging  with  ballistic 
light  is  virtually  impossible  through  thick  specimens 
due  to  the  exponential  attenuation  of  the  ballistic 
component  with  sample  thickness.  We  have 
derived  a  fundamental  ballistic  imaging  thickness 
limit  for  any  optical  imaging  technique  based  on 
quantum  noise  considerations  and  optical  tissue 
damage  thresholds.  For  parameters  required  for 
early  breast  tumor  detection,  this  thickness  limit 
evaluates  to  approximately  4  mm  breast  tissue. 

The  combined  temporal  and  directional  rejection  of 
scattered  light  with  transiilumination  OCT  allows  a 
comparison  of  time  and  spatially  resolved  imaging 
techniques.  A  purely  spatial  discrimination  tech¬ 
nique,  such  as  confocal  imaging,  will  be  sensitive  to 
the  transiilumination  OCT  interference  signal 
integrated  over  all  coherence-gate  delays.  Studies 
of  time-resolved  photon  migration  with  confocal 
imaging  show  that  ballistic  imaging  is  impossible  for 
spatially  resolved  methods  because  a  substantial 
amount  of  late  arriving  diffuse  light  may  be  coherent 
at  the  detector  when  the  sample  is  thicker  than 
about  20  scattering  MFPs. 

Since  ballistic  imaging  is  restricted  to  relatively  thin 
samples,  medically  relevant  applications  require 
gating  the  early  arriving  portion  of  the  diffuse  trans¬ 
mitted  light.  Spatial  resolution  is  sacrificed  for  the 
wider  coherence-gate  delays  necessary  to  image 
through  thick  samples.  We  characterized  the 
degradation  in  resolution  versus  coherent  photon 
arrival  time  for  various  sized  opaque  bars  placed  in 
25  scattering  MFPs  of  1  urn  diameter  microspheres. 
The  spatial  resolution  for  a  given  transit  delay  was 
found  to  be  bounded  above  by  a  simple  square-root 
limit,  obtained  from  geometric  considerations,  and 
below  by  an  empirically  derived  linear  limit.  Consid¬ 
eration  of  these  limits  enabled  us  to  design  a  trans¬ 
illumination  OCT  system  capable  of  resolving  a  1 
mm  width  resolution  chart  embedded  in  1.5  cm 
chicken  breast  in  vitro.  To  our  knowledge,  this 
image  represents  the  thickest  biological  sample  with 
similar  scattering  characteristics  through  which 
submillimeter  image  resolution  has  been  obtained. 

The  fundamental  resolution  and  thickness  limits  we 
have  identified  in  these  basic  studies  should  prove 
useful  in  the  design  of  future  time-domain  imaging 
techniques  for  biomedical  applications. 


1.16  Observation  of  Gain  in  Ni-like  Nb 
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During  the  past  four  years,  we  have  assembled  an 
experimental  facility  suitable  for  testing  various 
novel  and  low  power  approaches  to  EUV  and  soft 
x-ray  lasers.®^  Our  facility  consists  of  a  Nd:YLF 
oscillator  and  Nd:glass  pre-amplifier  (currently  being 
upgraded  to  a  Nd.YLF  pre-amplifier),  a  zig-zag  slab 
power  amplifier,  a  200-liter  vacuum  chamber  with 
an  internal  target  positioning  and  alignment  system, 
and  two  EUV  spectrometers.  Various  components 
of  our  facility  have  been  described  in  previous 
annual  RLE  Progress  Reports. 

Last  spring  we  began  taking  data,  searching  for  the 
laser  line  in  Ni-like  Mo.  The  principal  diagnostic  for 
this  work  is  a  nearly  stigmatic  spectrometer  which  is 
based  on  a  near-normal  incidence  spherical  grating 
which  images  onto  the  cathode  of  a  streak  camera 
as  illustrated  below  in  figure  1.  The  grating  was 
loaned  by  the  National  Research  Laboratory  (NRL); 
the  streak  camera  was  loaned  by  Lawrence 
Livermore  National  Laboratories. 

The  principal  laser  lines  in  mid-Z  Ni-like  ions  have 
not  been  previously  identified,  and  as  a  result  we 
have  relied  on  theory  to  a  significant  degree  to 
guide  our  search.  The  computation  of  accurate  line 
positions  for  the  3d®4d ’S  -  3d®4p ’P  transition  is 
hindered  by  strong  correlation  effects  due  to  mixing 
between  the  upper  laser  state  and  the  4d'°'S 
ground  state.  For  example,  the  results  of  a  multi¬ 
configuration  Hartree-Fock  calculation  gives  the 
wavelength  of  the  line  in  Ni-like  Mo  to  be  near  172 
A;  the  correlation  corrections  bring  the  wavelength 
near  190  A,  a  correction  of  about  ten  percent.  We 
first  corrected  the  wavelength  using  data  from  Eu 
obtained  by  Lawrence  Livermore  National  Laborato¬ 
ries,  which  led  to  an  interpolated  wavelength  of  194 
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A.  This  value  was  updated  using  a  low-Z  measure¬ 
ment  in  As  to  provide  a  further  correction  to  191  A. 
In  the  spring  of  1992,  a  measurement  of  the  analog 
line  in  Ni-like  Sn  was  reported  by  the  NRL  at  119.1 
A;  we  were  able  to  use  this  value  to  improve  our 
prediction  to  188.8  A  for  Mo.  When  we  finally 
found  the  laser  line  in  Ni-like  Mo,  it  appeared  at 
189.2  A,  as  shown  in  figure  2. 


The  efficiency  of  the  NRL  grating  becomes  quite 
low  at  wavelengths  below  200  A,  and  this  poor  effi¬ 
ciency  hindered  our  experiments;  many  shots  would 
be  required  to  find  the  nearby  strong  Cu-like  lines, 
and  the  laser  line  would  appear  after  only  the  most 
heroic  efforts.  As  a  result,  we  decided  to  shift  the 
primary  focus  of  our  experiments  to  a  study  of  gain 
in  Ni-like  Nb,  which  was  predicted  to  occur  at 
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Figure  1 .  Schematic  of  the  x-ray  laser  target  chamber  and  spectrometers. 
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Figure  2.  Spectrum  of  Mo  in  the  vicinity  of  the  Ni-like  Mo  3d’4d  'S  -  3d’4p  ’P  laser  line. 
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203.7  A.  The  transition  in  Nb  was  readily  found, 
and  it  occured  at  a  wavelength  of  204.2  A. 

The  principal  laser  lines  in  Ni-like  ions  are  quite 
weak  if  no  gain  is  present.  Since  we  observed  the 
lines,  and  since  we  were  able  to  make  the  candi¬ 
date  laser  line  dominate  the  spectrum  locally,  we 
knew  that  we  had  produced  significant  gain.  This 
fact  is  important,  because  previously  gain  in  Ni-like 
ions  had  only  been  produced  at  larger  laser  facili¬ 
ties  in  experiments  involving  kilojoules  of  incident 
laser  pump  energy;  in  our  experiments  we  used 
less  than  1  joule  per  pump  pulse.  Our  result  consti¬ 
tuted  a  rather  major  breakthrough  in  the  area  of 
collisionally-driven  EUV  and  soft  x-ray  lasers. 

It  remained  to  quantify  the  small  signal  laser  gain. 
Experiments  were  conducted  in  which  the  beam 
was  blocked  to  produce  various  plasma  amplifier 
lengths,  and  the  resulting  measured  intensity  was 
plotted  as  a  function  of  the  amplifier  length.  The 
resulting  curve  shows  exponentiation  characteristic 
of  laser  gain,  with  a  corresponding  gain  coefficient 
of  about  3  cm-’,  as  shown  in  figure  3.  To  date,  out 
best  data  has  been  taken  at  a  length  of  9.1  mm, 
using  about  1  joule  per  pulse  of  pump  energy.  The 
resulting  spectra,  shown  in  figure  4,  is  characteristic 
of  between  3.0-3.5  total  gain-lengths. 
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Figure  3.  Intensity  as  a  function  of  length  for  the  Ni-like 

Nb  3d®4d  'S  -  3d»4p  ’P  laser  line. 
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Figure  4.  The  3d*4d  'S  -  3d*4p  ’P  laser  line  (204.3  A)  in 
Ni-like  Nb  at  9.1  mm. 
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1.17  Pump  Pulse  Length 
Project  staff 

James  G.  Goodberlet,  Professor  Peter  L. 
Hagelstein 

We  lost  a  Nd:glass  preamplifier  rod  due  to  optical 
damage  in  November,  and  this  loss  has  caused 
downtime.  We  took  the  opportunity  to  upgrade  the 
preamplifier  to  a  Nd:YLF  rod,  but  before  coming 
back  up  to  run  integral  EUV  laser  experiments,  we 
wanted  to  check  the  pulselength  to  make  certain 
that  the  oscillator  was  not  producing  short  pulses 
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spuriously.  According  to  specifications,  the  Lum- 
onics  oscillator  is  supposed  to  provide  70  psec 
pulses;  significantly  shorter  pulses  at  full  energy 
could  potentially  damage  the  system. 

To  investigate  this  issue,  autocorrelation  of  the 
electric  field  was  accomplished  using  a  Michaelson 
interferometer  with  tilted  mirrors  to  produce 
noncollinear  interfering  beams  at  the  detector.  If 
the  pulses  arrive  at  the  detector  coincidently,  they 
produce  an  intereference  pattern;  additional  path 
length  in  one  arm  produces  a  delay  which  can 
reduce  or  destroy  the  interference. 

The  initial  results  of  the  interferometry  indicated  that 
some  of  the  pulses  produced  by  the  oscillator  were 
as  short  as  25  psec.  A  weakness  of  an 
autocorrelator  based  on  field  rather  than  intensity  is 
that  it  does  not  distinguish  between  a  short  pulse 
and  a  chirped  pulse;  inspection  of  the  second  har¬ 
monic  signal  from  a  KDP  crystal  indicated  that 
indeed  short  pulses  were  present. 

The  large  variation  in  pulse  length  was  found  to  be 
due  to  an  absence  of  prelase  signal  before  the 
oscillator  Q-switched;  the  Lumonics  laser  was  ori¬ 
ginally  shipped  and  brought  up  at  MIT  with  Lum¬ 
onics  staff  to  operate  without  prelasing.  We  reset 
the  oscillator  to  prelase  before  Q-switching  which 
stabilized  the  system. 

Intensity  autocorrelation  was  performed  on  the 
oscillator  output,  and  the  resulting  pulses  were 
determined  to  be  near  60  psec;  an  etalon  was 
acquired  which  results  in  120  psec  pulses. 

In  light  of  these  results,  some  of  the  experimental 
difficulties  which  had  been  noted  were  likely  due  to 
the  substantial  pulse  variability  that  was  present  in 
the  system.  A  question  now  arises  whether  the 
best  data  that  we  obtained  which  showed  gain 
occured  because  the  pump  pulse  was  short,  fol¬ 
lowing  from  the  increased  intensity  present.  If  so, 
this  will  limit  how  much  more  EUV  laser  total  gain 
we  will  be  able  to  obtain  in  the  future.  The  depend¬ 
ence  of  gain  on  pulse  length  will  be  the  focus  of 
experiments  to  take  place  in  the  coming  months. 


1.18  End-pumped  X-ray  Lasers 

Sponsors 

Bose  Corporation 

Lawrence  Livermore  National  Laboratories 
Contract  B1 60530 
U.S.  Department  of  Energy 
Grant  DE-FG02-89-ER14012 


Project  Staff 

Professor  Peter  L.  Hagelstein,  Dr.  Santanu  Basu, 
Martin  Muendel 

Our  research  has  recently  led  to  a  novel  and  very 
efficient  EUV  amplifier  which  is  able  to  obtain  1 
gain-length  at  204  A  per  0.3  joules  of  incident  1n 
pump  energy;  previous  experimental  work  using 
collisional  excitation  schemes  in  this  wavelength 
range  has  involved  relative  pumping  efficiencies 
measured  in  gain-lengths  per  kilojoule. 

The  improvements  which  we  have  obtained  are  due 
primarily  to:  (1)  the  use  of  Ni-like  ions,  which  has 
not  previously  been  explored  in  this  wavelength 
range;  and  (2)  the  use  of  a  pulse  train  to  create  and 
then  pump  the  plasma,  which  results  in  a  very  low 
ion  temperature  with  an  associated  high  gain  due  to 
a  reduction  in  Doppler  broadening. 

The  question  arises  as  to  whether  further  improve¬ 
ments  might  lead  to  still  more  gains  in  efficiency. 
For  example,  most  of  the  laser  energy  which  is 
absorbed  goes  ultimately  into  hydrodynamic  kinetic 
energy  or  simply  heats  high  density  material  far 
away  from  where  gain  is  produced.  Only  about  one 
part  in  10^  of  the  absorbed  pump  energy  is  ulti¬ 
mately  utilized  for  the  production  of  gain. 

We  have  considered  possible  approaches  towards 
improving  the  efficiency  of  EUV  and  soft  x-ray  laser 
amplifiers.  If  a  pre-expanded  low  density  (around 
10’®  electrons/cm®)  plasma  amplifier  could  be  pro¬ 
duced,  then  laser  energy  absorbed  at  low  density 
would  be  used  more  efficiently  to  produce  gain  (the 
conversion  efficiency  of  laser  light  to  hydrodynamic 
energy  in  such  a  plasma  is  very  poor).  But  such  a 
scheme  is  ill-suited  to  current  x-ray  laser  pumping 
arrangements  since  the  absorption  coefficient  of  a 
low  density  plasma  is  very  low;  less  than  one 
percent  of  the  pump  radiation  would  be  absorbed  in 
a  100  wide  low  density  plasma  pumped  from  the 
side  under  conditions  where  gain  would  be 
expected. 

A  potential  solution  to  this  problem  is  to  modify  the 
pumping  geometry  and  to  pump  from  the  end 
instead  of  the  side.  Ideally,  the  pump  radiation 
would  in  this  case  be  absorbed  where  it  does  the 
most  good,  and  the  EUV  beam  builds  up  as  it 
copropagates  with  the  optical  beam.  The  total 
number  of  gain  lengths  produced  before  the  beam 
is  completely  absorbed  will  determine  whether  the 
approach  is  a  good  one. 

The  effectiveness  of  this  approach  can  be  seen  by 
comparing  the  total  number  of  EUV  gain  lengths  gL 
with  the  number  of  inverse  Bremsstrahlung  absorp¬ 
tion  lengths  xibL.  Interestingly  enough,  this  ratio  is 
maximized  at  low  density.  Normally,  x-ray  laser 
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designers  seek  to  maxirize  the  small  signal  gain  in 
a  laser  design;  in  tivs  case  the  total  number  of 
gain-lengths  is  ma'  .nized  at  a  small  signal  gain 
which  is  less  .  ,etn  the  maximum  possible  small 
signal  gain. 

At  an  electron  density  which  is  low  compared  to  the 
critical  density  (Ne-^Nc),  and  also  low  compared  to 
the  characteristic  density  of  the  laser  transition 
(NexAu/ <  ov  >u),  the  ratio  of  the  EUV  gain  to  IR 
absorption  becomes  essentially  independent  of 
electron  density.  It  can  be  shown  that  the  low 
density  limit  is  given  by 

gL  Nq  <  ov  4  /  n  ^  (mkTg) 

K,bL“  SE  A,  3V2  z*e®lnAx2  ' 

In  this  formula,  ose  is  the  stimulated  emission  cross 
section.  No  is  the  ground  state  3d'®  'S  population, 
<ov>oo  is  the  rate  coefficient  for  excitations  from 
the  ground  state  to  the  upper  state,  Z*  is  the 
average  ionization  charge  assuming  a  single 
element  plasma,  and  (1  -  ^)  is  the  ratio  of  the  inver¬ 
sion  density  to  the  upper  laser  state  density  at  low 
electron  density.  In  the  case  of  a  low  density 
Ni-like  Mo  EUV  amplifier,  this  ratio  can  be  larger 
than  100. 

The  implication  of  this  result  is  that  it  should  be 
possible  in  principle  to  obtain  further  very  large 
improvements  in  the  efficiency  of  EUV  and  soft 
x-ray  lasers  through  the  successful  development  of 
end-pumped  '“sers. 

A  primary  technical  drawback  of  such  pumping 
schemes  is  that  the  pump  laser  will  either  refract  or 
diffract  out  of  the  plasma  amplifier  before  any  signif¬ 
icant  fraction  of  the  beam  energy  is  absorbed  in  a 
real  system  unless  some  way  can  be  found  to  keep 
the  pump  beam  in  the  plasma.  It  is  natural  to  con¬ 
sider  a  "soda-straw"  type  geometry:  unfortunately, 
such  a  system  by  necessity  brings  high  density 
plasma  and  the  associated  hydrodynamic  losses  in 
close  proximity  to  the  plasma  amplifier. 

We  have  therefore  considered  pumping  schemes 
that  involve  segmented  amplifiers  wherein  external 
optical  components  are  used  to  help  refocus  the 
beam  following  transmission  through  individual  seg¬ 
ments.  Examples  of  such  schemes  are  shown  in 
figures  5-7. 

Publication 

Hagelstein,  P.L.,  S.  Basu,  and  M.  Muendel.  "XRL 
Research  at  MIT:  Recent  Progress."  Pro¬ 
ceedings  of  the  Third  International  Conference 
on  X-ray  Lasers,  Schliersee,  Germany,  May 
1992. 


Figure  5.  Schematic  of  a  simple  stable  end-pumped 
lasei . 


Figure  6.  Schematic  of  a  zig-zag  end-pumped  laser. 


Figure  7.  Schematic  of  solid  surface  targets  pumped  in 
sequence. 


1.19  The  Kinetics  and  Hydrodynamics 
of  Laser-Generated  Plasmas 

Sponsor 

MIT  Lincoln  Laboratory 

Project  Staff 

Professor  Peter  L.  Hagelstein,  Ann  W. 
Morgenthaler,  Janet  L.  Pan 

Laser-produced  plasmas  which  are  ablated  from  the 
surface  of  a  solid  are  currently  being  used  for  x-ray 
laser  amplifiers  in  a  number  of  laboratories  around 
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the  world;  our  group  at  MIT  has  been  focusing  on 
the  use  of  such  plasmas  primarily  for  collisional 
(Ni-like)  x-ray  lasers  as  described  elsewhere  in  'his 
report. 

The  success  of  the  collisional  scheme  depends 
strongly  on  achieving  a  high  electron  temperature  in 
the  low-density  coronal  region  of  the  plasma.  The 
laser  light  is  absorbed  primarily  near  the  critical 
surface  (at  10^'  electrons/cm^  for  1  pm  pump  radi¬ 
ation),  where  the  electron  temperature  is  deter¬ 
mined  by  a  balance  between  inverse 
Bremsstrahlung  absorption  and  flux-limited  thermal 
conduction  in  other  parts  of  the  plasma.  The 
thermal  conductivity  is  a  strongly  increasing  function 
of  the  electron  temperature,  and  if  is  expected  that 
once  a  high  temperature  is  established  near  the 
critical  surface,  heating  of  the  low  density  corona  by 
conduction  will  quickly  follow. 

A  basic  understanding  of  electron  the'mal  con¬ 
duction  in  the  plasma  is  therefore  critical  for  making 
a  quantitative  prediction  of  the  electron  temperature 
as  a  function  of  the  pump  laser  intensity,  especially 
in  regimes  where  significant  flux  limiting  occurs. 
This  has  motivated  our  interest  in  the  basic  physics 
of  electron  transport  in  laser-produced  plasmas. 
The  approach  which  we  have  explored  involves 
taking  moments  of  the  full,  collisional  Boltzmann 
equation,  with  Coulombic  collisions  described  by 
the  Landau  operator  as  outlined  in  last  year's  RLE 
Progress  Report.  The  treatment  of  collisions  in  the 
moment  method  is  technically  complicated,  and  we 
have  recently  reported  a  rather  general  solution  to 
this  problem  that  is  amenable  to  numerical  mod¬ 
eling.®^  Exact  analytic  coupling  coefficients 
describing  the  interaction  between  identical  particles 
during  a  collision  are  a  central  result  of  this  paper, 
and  this  work  has  been  extended  to  describe  arbi¬ 
trary  collisions  between  unlike  particles  (e.g., 
electron-ion  collisions)  which  are  critical  for 
describing  plasma  interactions. 

The  end  result  of  the  moment  method  is  a  set  of 
coupled  equations  describing  average  plasma 
quantities  such  as  density,  mean  velocity,  temper¬ 
ature,  and  heat  flux.  The  simplest  useful  set  of 
such  coupled  equations  are  those  describing  elec¬ 
trons  and  ions  in  a  1  1/2  dimensional  model  (1-D 
space  and  2-D  velocity).  The  time-evolution  equa¬ 
tions  for  the  lowest  order  moments  are  mass  conti¬ 
nuity  equations; 


(1) 

|-N,.4-(N,v,).0 

(2) 

where  and  N,  are  electron  and  ion  densities,  and 
Ve  and  V,  are  average  z-directed  electron  and  ion 
velocities.  The  first  order  moments  give  rise  to 
momentum  equations 
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Here,  Te  and  T,  are  electron  and  ion  temperatures, 
m  and  M  are  electron  an  ion  masses,  Z  is  the 
average  ionic  charge  of  the  plasma,  and  In  a  is  the 
electron-ion  Coulomb  logarithm.  A  final  set  of  equa¬ 
tions  describes  the  temperature  evolution  of  this 
plasma: 
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where  the  linearized  thermal  conductivity  k  can  be 
derived  to  arbitrary  accuracy  from  the  higher-order 
moment  equations  for  this  simplest  plasma.  The 
heat  conduction  term  in  (5)  is  the  place  where  the 
higher  order  moments  couple  to  the  temperature 
equation,  and  by  considering  sufficient  numbers  of 
higher-order  moments,  thermal  flux  limitation  should 
appear  quite  directly. 

It  appears  that  the  plasma  hydrodynamic  equations 
have  many  similarities  with  the  equations  governing 
drift  and  diffusion  in  semi-conductors:  therefore,  we 
will  attempt  to  numerically  solve  the  moment  equa¬ 
tions  with  efficient  techniques  such  as  Gummel's 
method.  The  equations  governing  electron  and  ion 
motion  are  not  typically  solved  in  this  coupled  form, 
and  our  goal  is  to  produce  numerical  models  of  a 
laser-generated  plasma  which  may  be  useful  design 
tools  for  further  developmenis  of  the  x-ray  laser. 
We  are  also  interested  in  quaoi-analytic  models  of 
plasma  heating  by  intense  electric  fields,  which  can 
be  found  by  considering  fast  and  slow  variations  of 
all  field  Quantities.  A  simple  model  describing  the 
coupling  of  electromagnetic  fields  into  the  plasma  is 
currently  under  investigation. 

Because  the  "machinery"  developed  for  under¬ 
standing  laser  plasma  kinetics  is  sufficiently 
general,  we  should  be  able  to  apply  our  techniques 
to  other  outstanding  problems  in  plasma  physics. 


1.20  An  Analytical  Solution  of  the  2D 
Exciton*Phonon  Matrix  Element 

Project  Staff 

Sumanth  Kaushik,  Professor  Peter  L.  Hagelstein 

Work  has  been  on  the  development  of  analytical 
and  numerical  methods  for  the  ab  initio  absorption 
lineshapes  in  two  dimensional  systems.  We  have 
focused  our  attention  of  multiple  quantum  well 
structures  (MOW)  for  which  considerable  amount  of 
experimental  data  exist  in  the  literature.  The  in¬ 
plane  confinement  of  the  electron  and  hole  due  to 
the  quantum  wells  enable  these  devices,  even  at 
room  temperatures,  to  exhibit  pronounced  excitonic 
effects  which  can  be  strongly  modified  by  optical 
and  electro-optical  modulation. 

An  important  parameter  that  characterizes  the 
strength  of  excitonic  effects  is  the  absorption  line- 
width.  The  sharper  the  linewidth,  the  more  pro¬ 
nounced  are  the  excitonic  effects.  At  room 
temperatures,  the  dominant  contribution  to  the  line- 
width  is  the  interaction  of  the  quasi  two-dimensional 
excitons  with  optical  phonons.  Various  theoretical 
models  exist  in  the  field  to  calculate  the  phonon 


contribution  to  the  absorption  linewidth.  However,  in 
all  these  models,  a  fundamental  quantity  needed  in 
the  computation  is  the  photon/phonon  matrix 
element 

<K  m  I  exp(iq  •  r)  1 0,0>  ~  I 

between  the  ground  state  and  an  arbitrary  excited 
state  of  the  exciton. 

Often,  for  convenience,  the  exciton  is  modeled  as  a 
two-dimensional  hydrogen  atom  where  the  para¬ 
meters  such  as  the  Bohr  radius  and  the  binding 
energy  are  adjusted  so  as  to  reproduce  more  exact 
calculations.  In  the  limit  where  the  initial  and  final 
states  are  2D  hydrogen  states,  the  above  matrix 
element  can  be  evaluated  analytically.  However,  in 
evaluating  this  matrix  element,  one  encounters  an 
integral  transform  of  the  form 


I  ■  x(bx)''  exp(  -  px)i  F.,  (a,2v  +  1  ,bx)J^(Yx)dx 

■'  0 

where  a,b,p  are  complex  and  v,  y  are  real.  In  the 
particular  case  of  the  matrix  element  the  para¬ 
meters  are  defined  to  be  b=1/2X,  v  -  |m|  and 
p=1  +  1/4X.  with  m  being  an  integer  and  k  being 
either  real  (for  bound-bound  transitions)  or  complex 
(for  bound-free  transitions). 

We  have  managed  to  formulate  an  analytical  sol¬ 
ution  to  this  matrix  element  which  is  written  in  the 
form 

l-(^)r(2v  +  1)^^j|-^"''-'x 

|^(2v  +  1  -  2a)5P:;.v(p)  -  a(2  - 1-  jP^a^v-  i(p)] 

where  the  a,b,p  are  arbitrary  and  complex  and  v,  y 
are  arbitrary  and  real  and  the  parameters.  This 
matrix  element,  in  addition  to  enabling  rapid  compu¬ 
tation  of  lineshape,  enables  one  to  develop  very 
useful  and  simple  scaling  relations  for  the  linewidth 
in  term  of  phonon  coupling  and  well  parameters. 
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1.21  Quantum  Dot  Diode  Lasers 
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Janet  L.  Pan 

Lasers  in  the  thermal  infrared  (2  to  10  micrometer) 
wavelength  regime  are  useful  for  spectroscopy, 
medicine,  and  fiber  optics.  For  medicinal  purposes, 
these  lasers  take  advantage  of  the  large  absorption 
of  water  in  various  windows  in  the  thermal  infrared. 
For  spectroscopy,  these  lasers  are  useful  because 
many  substances  (such  as  organic  molecules  or 
pollutants  in  the  air)  have  characteristic  vibrational 
frequencies  in  the  thermal  infrared. 

We  are  interested  in  quantum  dots  for  improving 
semiconductor  lasers  in  the  thermal  infrared.  In  this 
wavelength  regime,  the  Auger  rate  limits  the  perfor¬ 
mance  of  conventional  semiconductor  lasers,*®  such 
as  the  lead  salt  semiconductor  lasers,  to  low  tem¬ 
peratures.  Quantum  dots,  with  their  discrete  energy 
levels,  could  greatly  reduce  the  Auger  rate.  We 
have  calculated  interband  Auger  processes  in 
quantum  dots,®^  and  have  shown  that  the  use  of 
large  potential  barriers  surrounding  quantum  dots 
could  greatly  reduce  the  Auger  rates  in  quantum 
dot  lasers  over  those  in  conventional  semiconductor 
lasers. 


Figure  8  shows  the  temperature  dependence  of  the 
CCCV  and  CVVV  intraband  Auger  lifetimes  for  a 
150  A  radius  InSb  quantum  dot  (with  a  bandgap  at 
4.8  micrometers  (258  meV))  surrounded  by  CdTe 
barriers.  We  found  that  enough  Auger  processes 
were  excluded  by  the  large  CdTe  potential  barriers 
so  that  our  InSb  quantum  dot  room  temperature 
Auger  lifetime  was  about  135  nanoseconds.  This  is 
to  be  compared  with  the  calculated  Auger  rates  for 
other  semiconductors  with  band  gaps  in  the 
infrared.  For  GaSb  with  a  bandgap  of  670  meV 
(1.8  micrometers),  the  Auger  lifetime  at  77  K  for  a 


carrier  concentration  in  the  bulk  of 


calculated®®  to  be  about  0.01  nanoseconds  and 
measured®®  to  be  about  1  nanosecond  for  our 

volume  of  -|-n(150A)®.  A  4.5  nanosecond  Auger 

w 

lifetime®®  was  found  at  liquid  nitrogen  temperature 
for  a  conventional  10  urn  PbosaSnceTe  laser, 
whose  active  re#bn  injected  carrier  concentration  is 

0.8  X  10'^cm-®  or  our  volume  of  -^n(150A)®.  Room 

temperature  bulk  InSb  seems  to  have  calculated 
Auger  lifetimes  between  0.1  and  1  nanosecond, 
depending^®  on  how  the  atomic  orbitals  and 
nonparabolic  energy  bands  are  calculated. 

We  are  now  studying  processes  affecting  the 
dynamics  of  bound  carriers  in  quantum  dots.  We 
have  estimated  rates  of  collisions  of  quantum  dot 
carriers  via  the  Coulomb  interaction  amd  compared 
these  rates  to  the  competing  interband  and  intra¬ 
band  radiative  rates  and  phonon 
absorption/emission  rates  in  quantum  dots. 


Figure  8.  The  temperature  dependence  of  the  CCCV 
and  CVW  Auger  lifetimes  for  a  150  A  radius  InSb 
quantum  dot  surrounded  by  CdTe  barriers.  This  is  to  be 
compared  with  the  calculated  Auger  rates  for  other  semi¬ 
conductors  with  band  gaps  in  the  infrared. 
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We  are  interested  in  materials  and  fabrication 
methods  necessary  for  realistic  devices.  We  have 
suggested  various  possibilities,  such  as  InSb  dots 
in  CdTe  barriers,  in  a  recent  paper.^’  Our  long  term 
goal  is  to  fabricate  and  experimentally  study 
quantum  dots  for  use  in  lasers  in  the  thermal 
infrared. 


1.22  Coherent  Fusion  Theory 

Project  Staff 

Professor  Peter  L.  Hageistein,  Irfan  U.  Chaudhary, 
Akikazu  Hashimoto,  Sumanth  Kaushik 

Three  and  a  half  years  have  now  passed  since  the 
announcement  of  the  observation  of  anomalies  in 
deuterated  metals  that  was  made  in  March  1989. 
Excess  heat  was  reported  by  Pons  and 
Fleischmann,  and  evidence  for  low  level  neutron 
emission  was  presented  by  Jones.  Efforts  to  repro¬ 
duce  the  effects  were  largely  unsuccessful  in  1989, 
and  the  scientific  community  has  generally  rejected 
these  claims. 

Pons  and  Fleischmann  have  continued  their  exper¬ 
iments,  and  their  current  claims  are  far  more  dra¬ 
matic  than  those  made  in  1989.  The  initial  1989 
reports  described  a  relatively  weak  heat  effect  that 
was  observed  with  relatively  poor  reproducibility; 
the  claims  of  1992  describe  a  very  dominant  effect 
that  is  completely  reproducible,  although  still  not 
understood. 

At  the  Thirrd  International  Cold  Fusion  Conference 
in  Nagoya,  Japan,  Pons  and  Fleischmann 
described  reproducible  observations  of  extreme 
excess  heat  production  in  electrolysis  experiments. 


with  a  specific  power  generation  rate  of  more  than 
3kW/cm3  of  Pd  cathode.^^  This  high  level  of  heat 
production  is  sufficient  to  boil  away  the  electrolyte 
(50  cc)  completely  in  about  10  minutes.  Energy 
production  was  monitored  with  their  open-cell 
calorimeter,  and  gave  a  result  that  was  consistent 
with  an  energy  balance  determination  by  multiplying 
the  heat  of  vaporization  of  the  electrolyte  (about  41 
kJ  per  mole)  times  the  number  of  moles  of 
electrolyte  (2.5  moles),  yielding  about  100  kJ  of 
energy  output.  The  electrical  energy  input  during 
this  period  was  about  20  kJ,  resulting  in  a  power 
gain  of  about  4.  The  anomalous  excess  energy 
production  during  this  period  was  close  to  200  eV 
per  atom  of  Pd. 

We  have  continued  to  explore^^  theoretical  mech¬ 
anisms  which  could  account  for  the  heat  anomaly, 
and  for  other  anomalies  (including  tritium  and  *He 
production,  and  also  gamma,  neutron  and  fast  ion 
emission)  which  continue  to  be  reported  by 
numerous  laboratories. 

We  have  considered  several  different  types  of  pos¬ 
sible  nuclear  reaction  mechanism;  fusion  as  a 
mechanism  appears  to  be  unlikely  since  there  does 
not  appear  to  be  any  practical  way  to  get  deuterium 
nuclei  close  enough  to  fuse  at  room  temperature  in 
a  metal  lattice.  We  have  therefore  proposed  a  new 
and  speculative  type  of  nuclear  reaction  involving 
the  transfer  of  neutrons  from  "donor"  nuclei  to 
"acceptor"  nuclei. 

During  the  past  year  we  have  made  progress 
towards  understanding  reactions  in  which  a  neutron 
is  transfered  from  deuterium  and  then  captured 
incoherently  by  a  heavy  metal  nucleus,  with  no 
energy  exchange  with  the  lattice.^* 
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Figure  9.  Schematic  of  a  two-step  reaction  involving  a 
virtual  intermediate  state. 

This  type  of  two-step  reaction  involves  a  virtual 
intermediate  state  that  contains  a  free  neutron.  It  is 
known  that  virtual  neutrons  in  free  space  do  not 
stray  very  far  away  from  their  point  of  origin.  The 
Green's  function  for  a  virtual  neutron  in  free  space 
satisfies 

[E-Ho]G(r|ro)-8V-ro)  (1) 

In  this  case  Ho  -  -  and  the  solution  can  be 

found  analytically  to  be 

G(r|ro) - r - (2) 

4n|r-rol  ” 

where  a  -  ^[^\  E  for  E  negative.  The  Green's 
function  is  damped  exponentially,  with  a  decay  con¬ 
stant  that  is  on  the  order  of  an  inverse  fermi. 
Because  of  this,  one  would  naively  expect  that  a 
virtual  neutron  would  never  be  able  to  go  as  far  as 
a  neighboring  nucleus. 

We  have  explored  the  modifications  in  the  virtual 
neutron  Green's  function  that  would  be  brought 
about  by  the  presence  of  a  periodic  produced  by  a 
lattice.  In  this  case,  the  Green's  function  satisfies 

[E-Ho-V(r)]G(r|ro)-8V-ro)  (3) 

where  the  periodic  potential  V(r)  is  due  to  neigh¬ 
boring  nuclei  in  the  lattice.  We  have  shown  that  the 
presence  of  the  periodic  potential  gives  rise  to  a 
long  range  component  of  the  neutron  Green's  func¬ 
tion.  In  the  case  of  a  weak  perturbation,  we  have 
found  a  very  accurate  approximate  result  for  the 


Green's  function  which  exhibits  explicitly  long  range 
effects  that  are  on  the  scale  of  the  characteristic 
Bragg  extinction  length  (which  can  be  on  the  order 
of  a  micron). 

Unfortunately,  the  potential  produced  by  nuclei  in  a 
lattice  will  normally  be  too  weak  to  induce  very 
much  of  an  effect  with  being  resonantly  enhanced. 
Such  a  resonant  effect  has  recently  been  identified: 
a  virtual  neutron  that  is  transfered  from  a  deuteron 
will  be  resonantly  captured  by  a  proton  in  an  equiv¬ 
alent  site  to  make  another  deuteron.  By  itself,  this 
leads  only  to  a  weak  correction  of  the  Green's  func¬ 
tion;  but  if  a  rather  specific  type  of  phase  correlation 
in  space  were  to  exist  among  the  deuterons  and 
protons  in  the  lattice,  then  a  rather  dramatic 
enhancement  of  this  new  long  range  effect  will  take 
place. 

We  have  conjectured  that  diffusion  of  hydrogen 
isotopes  in  the  lattice  would,  in  the  quantum  limit, 
produce  the  required  coherence.  Very  recently  we 
have  obtained  an  existence  proof  of  sorts  that  it  is 
possible  for  a  metal  hydride  to  exhibit  predomi¬ 
nantly  coherent  tunneling  near  room  temperature. 
It  remains  to  be  demonstrated  that  quantum  dif¬ 
fusion  produces  Dicke  states;  this  is  an  area  of 
ongoing  research. 

One  proposed  model  for  heat  production  involves 
neutron  transfer  reactions  in  which  the  reaction 
energy  is  coupled  into  the  lattice.  In  last  year's 
RLE  report  we  described  an  approach  based  on  the 
Duschinsky  operator  which  appears  because  the 
initial  lattice  phonon  modes  differ  from  the  final 
state  phonon  modes  due  to  the  mass  change  at  the 
site  of  the  neutron  transfer;  the  method  proposed  in 
that  report  was  analyzed  in  detail  and  found  to  be 
incapable  of  significant  energy  transfer.  It  was 
found  that  the  Duschinsky  operator  is  not  partic¬ 
ularly  efficient  at  creating  phonons,  but  it  appears  to 
be  capable  of  transfering  energy  through  shifting 
the  frequency  of  phonon  modes. 

Dilute  impurities  in  a  metal  lattice  can  form  reason¬ 
ably  precise  phonon  bands,  termed  impurity  bands 
in  the  literature.  If  there  are  Nh  interstitial  hydrogen 
atoms  and  No  interstitial  deuterium  atoms  in  a  Pd 
lattice,  then  there  will  be  3Nh  impurity  modes  at  wh 
and  3Nd  impurity  modes  at  wo-  If  a  neutron  trans¬ 
fers  off  of  a  deuteron  to  make  a  proton,  then  the 
three  highest  deuterium  impurity  modes  will  be 
shifted  to  the  hydrogen  impurity  band.  The  associ¬ 
ated  energy  exhange  with  the  lattice  will  be  n 
h((OH  -  u)o)-  Since  the  impurity  band  modes  are  true 
continuum  modes,  the  number  of  phonons  in  a 
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single  mode  can  be  quite  large,  and  the  energy 
transfer  can  be  anomalously  larged® 

The  mechanism  described  above  has  the  potential 
to  couple  energy  even  if  the  virtual  neutron  capture 
is  incoherent.  The  anomalous  energy  transfer 
associated  with  neutron  capture  increases  the 
energy  available  for  possible  nuclear  reactions, 
which  can  increase  the  energy  of  reaction  products 
and  open  new  reaction  channels.  Such  reactions 
may  account  for  recent  observations  of  copious 
emissions  of  gammas,  alphas  and  neutrons  from 
glow  discharge  experiments.^® 


1.23  Deuterated  Metal  Experiments 

Sponsors 
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Professor  Peter  L.  Hagelstein,  Professor  Louis  D. 
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If  anomalous  effects  actually  do  occur  in  metal- 
deuterium  systems,  then  it  should  be  possible  to 
develop  experiments  at  MIT  which  demonstrate  one 
or  more  of  the  anomalies.  A  controversy  has 
existed  since  1989  as  to  whether  there  exist  any 
anomalous  effects  at  all,  with  the  large  majority  of 
the  scientific  community  being  currently  in  agree¬ 
ment  that  no  new  effects  exist  at  all.  The  demon¬ 
stration  at  MIT  of  one  or  more  of  the  anomalies 
(excess  heat  production,  tritium  production,  or  ener¬ 
getic  nuclear  products)  would  be  expected  to 
impact  this  controversy. 

A  primary  goal  of  our  experimental  effort  is  there¬ 
fore  to  develop  an  experiment  that  demonstrates  at 
least  one  anomalous  effect,  for  the  express  purpose 
of  placing  research  in  this  area  into  the  mainstream 
of  science. 

The  experimental  results  that  have  been  obtained 
to  date  in  other  laboratories  have  not  clarified  what 
reaction  mechanisms  are  involved  in  any  positive 
sense.  A  major  goal  of  the  present  effort  is  to  work 
towards  a  basic  understanding  of  the  underlying 


reaction  mechanism,  and  to  provide  experimental 
feedback  for  proposed  theoretical  mechanisms. 

There  are  a  number  of  experiments  which  have 
been  done  previously  for  which  a  confirmation  at 
MIT  would  prove  interesting.  Additionally,  there  are 
new  experiments  that  are  motivated  by  theoretical 
considerations  which  could  be  attempted.  The 
experimental  program  that  we  are  currently  pur¬ 
suing  includes  both  classes  of  experiment. 

Our  initial  efforts  towards  developing  experiments 
originated  from  theoretical  ideas  based  on  neutron 
transfer  reaction  model.  As  discussed  above,  the 
model  is  based  on  neutron  transfers  from  donor 
nuclei  (deuterium)  to  suitable  acceptor  nuclei.  Heat 
production  in  the  model  follows  from  coherent 
neutron  transfer  from  deuterium  to  lithium  or  boron; 
tritium  production  arises  from  coherent  neutron 
capture  on  deuterium;  gamma  and  alpha  production 
follows  from  semi-coherent  capture  on  metal  nuclei; 
neutron  and  energetic  ion  emission  would  follow 
from  neutron  capture  on  nuclei  accompanied  by  sig¬ 
nificant  energy  input  from  the  lattice. 

According  to  theory,  significant  relative  phase 
coherence  among  pairs  of  interstitials  in  the  lattice 
must  be  present  before  any  anomalies  can  occur. 
For  such  relative  phase  coherence  to  be  present, 
the  interstitials  must  diffuse;  the  specifics  of  this 
process  are  under  investigation  theoretically.  Metal 
deuterides  are  favored,  since  deuterium  is  the 
lightest  interstitial  that  can  donate  a  neutron. 
Palladium  is  favored  since  the  diffusion  coefficient 
of  deuterium  is  one  of  the  highest  for  an  fee  lattice; 
but  theory  would  suggest  that  V,  Nb  and  Ta,  which 
are  bcc  lattice  and  have  lower  diffusion  barriers, 
may  also  prove  interesting. 

From  theory,  diffusion  in  the  quantum  limit  will  have 
the  best  chance  of  setting  up  the  required  phase 
coherence.  A  review  of  the  experiments  showing 
positive  results  indicate  that  most  of  them  operate 
at  temperatures  somewhat  (within  a  factor  of  2) 
above  where  the  diffusion  is  in  the  quantum  limit. 
The  faster  the  diffusion,  the  larger  the  coherence 
volume,  hence  the  reactions  should  work  better  at 
higher  temperatures  up  to  a  maximum  temperature 
at  which  the  phase  differences  accumulated 
between  hydrogen  and  deuterium  diffusion  destroy 
the  coherence.  Where  this  occurs  is  unknown:  the 
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early  Karabut  paper  describes  a  cut-off  at  500  K,” 
Fleischmann  and  Pons  describe  high  heat  genera¬ 
tion  with  a  temperature  excursion  up  to  about  600 
and  Liaw  reported  excess  heat  in  molten  salt 
experiments  near  700  K7®  The  solubility  of  Pd 
decreases  markedly  at  higher  temperature,  which 
confuses  the  issue.  In  Nb,  with  higher  solubility  at 
elevated  temperatures,  effects  have  been  reported 
at  a  temperature  in  excess  of  1600  K.®® 

Consequently,  higher  temperature  appears  to  be 
favorable  for  the  anomalies.  Inasmuch  as  this 
leads  to  delocalization  of  the  metal  interstitials  and 
maintains  relative  phase  coherence,  it  is  consistent 
with  theory. 

The  first  experiments  which  we  set  out  to  explore 
involved  tritium  production.  From  a  mechanistic 
point  of  view,  tritium  production  is  simpler  than  heat 
production  since  only  light  hydrogen  interstitials  are 
diffusing.  The  diffusion  of  heavier  interstitials  such 
as  Li  and  B  in  PdD  is  not  well  understood,  but  it  is 
clear  that  the  associated  diffusion  coefficients  will 
be  smaller.  Tritium  production  is  mechanistically 
more  complicated  than  semicoherent  alpha  or 
gamma  production,  since  the  lattice  has  to  partic¬ 
ipate  in  a  nontrivial  way;  going  into  the  experiment 
we  are  not  guaranteed  that  the  lattice  will  be  cor¬ 
rectly  excited  to  take  up  the  reaction  energy. 

Observations  of  tritium  production  have  been 
reported  in  numerous  metal  deuterides  in  a  variety 
of  electrochemical,  gas  loading,  and  glow  discharge 
experiments.  One  such  experiment  was  developed 
at  LANL  by  Tom  Claytor.®’  In  this  experiment, 
deuterium  gas  is  absorbed  by  palladium  that  is  part 
of  a  stack  of  alternating  Si  and  Pd  layers;  a  current 
is  driven  through  the  stack  by  pulses  of  high 
voltage.  Tritium  is  reportedly  produced  under  these 
conditions  reproducibly  at  a  rate  of  10®-  10^  tritium 
atoms  per  second.  Tritium  production  was  also 
studied  by  Lanza,  who  assessed  the  relative  rates 


for  different  metal  host  lattices.®^  Tritium  production 
was  reportedly  observed  in  Ti,  Zr,  Hf,  Ta,  Ti-Zr  and 
in  Zircaloy  2;  the  highest  production  figures  were 
found  for  Ta. 

A  gas  ionization  gauge  has  been  developed  that  is 
capable  of  detecting  ionizing  radiation  at  low  levels; 
this  instrument  is  sufficiently  sensitive  to  detect 
about  10'®  tritium  atoms,  which  would  be  produced 
in  about  20  minutes  of  operation  at  a  rate  of  10^ 
atoms/second  (the  maximum  rate  reported  by 
Claytor). 

The  first  experiments  completed  to  date  involved  a 
search  for  tritium  evolved  from  gas-loaded  Ta  using 
the  ionization  gauge  as  a  tritium  detector;  no  tritium 
was  observed  in  this  experiment.  Follow-on  exper¬ 
iments  are  planned  that  will  test  for  tritium  pro¬ 
duction  in  V,  Nb,  Ta,  and  Pd  under  conditions  in 
which  (1)  it  can  be  verified  that  the  metal  loaded; 
(2)  current  flow  is  present;  and  (3)  increased  tem¬ 
peratures  are  maintained. 

One  confirmation  experiment  that  is  planned  is  a 
replication  of  the  Claytor  experiment,  to  be  carried 
out  in  collaboration  with  the  LANL  team.  Recently, 
Claytor  has  reported  success  in  producing  tritium  in 
a  PdD,  wire  loaded  in  deuterium  gas  driven  by  a 
pulsed  current  source.  We  are  modifying  our 
system  to  operate  in  this  mode. 

A  second  type  of  experiment  that  we  have  been 
developing  is  an  attempt  to  obtain  excess  heat  in  a 
gas-loading  experiment.  The  basic  idea  is  to 
assemble  donor  (deuterium)  and  acceptor  (boron) 
nuclei  in  a  host  metal  lattice  that  is  gas  loaded  and 
at  high  temperature.  If  successful,  this  type  of 
system  would  be  better  adapted  for  potential  appli¬ 
cations  than  excess  heat-producing  electrochemical 
systems,  both  because  it  is  technically  simpler  and 
because  energy  conversion  from  a  high  temper¬ 
ature  source  is  more  efficient. 
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An  experiment  has  been  constructed  to  carry  out 
this  type  of  experiment.  It  consists  of  a  heater  that 
is  surrounded  by  a  deuterium  gas  filled  annulus  in 
which  the  metal  sample  is  held  and  monitored.  The 
central  heater  and  annulus  are  surrounded  by 
magnesium  oxide  thermally  resistant  bricks:  due  to 
the  general  appearance  of  the  resulting  structure,  it 
has  earned  the  moniker  "Lenin's  Tomb."  As  a 
calorimeter,  Lenin’s  Tomb  has  been  calibrated  and 
has  been  found  to  be  reproducible  from  day-to-day 
while  heating;  power  excesses  on  the  order  of  a 
watt  would  be  detectable.  In  the  initial  trial  runs  of 
undoped  vanadium  (no  boron)  in  deuterium,  no 
excess  heat  has  been  observed  to  date. 

From  theory,  gamma  and  alpha  emission  due  to 
semi-coherent  neutron  capture  on  metal  nuclei 
should  occur  as  a  precursor  and  during  coherent 
reactions.  This  is  of  interest  since  the  fully 
coherent  reactions  are  difficult  to  diagnose  unless 
they  work  at  high  levels.  From  a  theoretical  point  of 
view,  the  semi-coherent  reactions  are  simpler  since 
they  do  not  require  any  special  degree  of  lattice 
excitation. 

We  have  used  a  Nal  gamma  detector  to  search  for 
excess  gamma  emission  from  gas  loaded  vanadium 
in  Lenin's  Tomb,  as  a  signature  of  precursor 
activity.  The  background  is  very  high,  and  no  posi¬ 
tive  signals  were  identifiable. 

A  next  step  in  the  experiment  is  to  arrange  for 
current  flow  in  the  deuterated  metal  samples,  and 
to  diagnose  for  gamma  emission  in  V,  Nb,  Ta  and 
Pd.  Some  effort  needs  to  be  put  into  improving  the 
signal  to  noise  ratio  in  the  gamma  detection.  The 
positive  observation  of  a  precursor  signal  would  be 
a  significant  step  forward  experimentally,  and  would 
bode  well  for  possible  observations  of  an  excess 
power  anomaly.  Experiments  on  boron-implanted 
gas  loaded  metals  constitute  the  next  phase  of  the 
experiments. 


We  had  worked  for  about  a  year  on  the  exper¬ 
iments  described  above  when  we  learned  of  some 
closely  related  glow-discharge  experiments  that 
were  reported  at  the  International  Cold  Fusion  Con¬ 
ference  in  Nagoya  last  November.  In  these  exper¬ 
iments,  various  metals  were  loaded  by  glow 
discharge  at  high  temperature  and  anomalies  were 
obsen/ed.  One  group  at  the  LUTCH  Association  in 
Moscow  observed  gammas,  alphas  and  neutrons  at 
very  high  levels  10^-10®  per  second  in  glow  dis¬ 
charge  experiments  using  a  Pd  cathode;  very  high 
levels  of  excess  power  (more  than  1  kilowatt/cm^) 
were  also  obsen/ed.®®  Another  experiment  at 
LUTCH  used  a  similar  glow  discharge  to  study 
tritium  production  at  high  temperature  in  a  variety  of 
metals:  tritium  production  at  the  very  high  rate  of 
1.7x10®  atoms/second  was  obsen/ed  in  Nb  at 
1170  K.®<  Glow  discharge  experiments  in  China 
gave  intense  neutron  and  gamma  signals  from  a 
variety  of  metals,  including  Pt,  Nb,  W  and  Pd.®® 

These  experiments  are  closely  related  in  that  metal 
deuterides  at  high  temperature  exhibit  positive  pre¬ 
cursor  signals,  as  well  as  heat  and  tritium  pro¬ 
duction.  They  differ  from  the  experiments  that  we 
have  been  developing  in  that  the  cathodes  are 
actively  loaded  by  the  discharge  and  achieve  very 
high  D/Pd  loading  ratios,  and  also  that  substantial 
current  density  is  present  in  the  cathodes.  We 
decided  to  attempt  a  confirmation  of  one  of  the 
glow  discharge  experiments  performed  at  LUTCH. 

We  have  developed  a  glow  discharge  that  runs 
under  conditions  similar  to  those  reported  by  the 
LUTCH  group,  in  terms  of  operating  current,  voltage 
and  gas  pressure.  This  discharge  is  unusual  in  that 
the  cathode  surface  current  density  can  reach 
several  hundred  milliamps/cm®;  this  high  surface 
current  level  is  accompanied  by  high  cathode  tem¬ 
peratures  which  appears  to  be  important.  This 
experiment  will  be  carried  out  in  collaboration  with 
Y.  Kucherov  of  the  LUTCH  group. 


83  A.B.  Karabut,  Y.R.  Kucherov,  and  I.B.  Sawatimova,  ’Nuclear  Product  Ratio  for  Glow  Discharge  in  Deuterium,"  Phys.  Lett.  A  170:  265 
(1992). 

8<  V.A.  Romodanov,  V.l.  Savin,  and  Y.M.  Timofeev,  'Nuclear  Fusion  in  Condensed  Matter,"  LUTCH  report  UDC:539.172.13;  presented 
at  the  Third  Annual  International  Cold  Fusion  Conference,  Nagoya,  Japan,  November  1992. 

85  L.  Heqing  et  al.,  presented  at  the  Third  Annual  International  Cold  Fusion  Conference,  Nagoya,  Japan,  November  1992. 
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2.1  Introduction 

The  central  theme  of  our  programs  has  been  to 
advance  the  understanding  of  optical  and  quasi- 
optical  communication,  radar,  and  sensing  systems. 
Broadly  speaking,  this  has  entailed:  (1)  developing 
system-analytic  models  for  imiMrtant  optical  propa¬ 
gation,  detection,  and  communication  scenarios;  (2) 
using  these  models  to  derive  the  fundamental  limits 
on  system  performance;  and  (3)  identifying  and 
establishing  through  experimentation  the  feasibility 
of  techniques  and  devices  which  can  be  used  to 
approach  these  performance  limits. 


2.2  Squeezed  States  of  Light 
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2.2.1  Experiments 

We  have  employed  optical  parametric  downconver- 
sion  in  a  type-1  phase-matched  LiNbOaiMgO  crystal 
in  our  efforts  to  generate  nonclassical  light.  Our 
focus  has  been  on  the  demonstration  of  squeezed 
amplification  of  a  below-threshold  optical  parametric 
amplifier  (OPA)  in  the  gain-saturated  regime.^ 
Under  gain  saturation,  the  amplified  output  intensity 
of  an  injected  coherent-state  signal  becomes  ampli¬ 
tude  squeezed,  and  the  large-signal  signal-to-noise 
ratio  improves.  Using  a  single-frequency  diode- 
pumped  YAG  laser  as  the  injection  source,  we  have 
made  extensive  mean-field  measurements:  gain 
saturation  and  output  lineshape  have  been  studied 
as  functions  of  the  pump  and  signal  powers,  as 
well  as  the  signal  frequency,  pump  frequency  and 
cavity  detunings.^  We  have  found  quantitative 
agreement  between  theory  and  experiment  for  the 
frequency  spacings  and  widths  of  the  key  features 
in  these  measurements.  We  have  observed  an 
unsaturated  signal-gain  of  10^  for  an  OPA  pumped 
at  97  percent  of  threshold  with  a  low  level  of  signal 
injection.  This  gain  is  reduced  to  13  at  the  same 
pump  level  when  the  signal  injection  is  increased  by 


'  Laboratory  for  Physical  Sciences,  College  Park,  Maryland. 

2  N.C.  Wong,  “Squeezed  Amplification  in  a  Nondegenerate  Parametric  Amplifier,'  Qpf.  Lett.  16(21);  1698-1700  (1991). 

3  K.X.  Sun,  N.C.  Wong,  and  J.H.  Shapiro,  “Squeezed  Amplification  with  Cavity  Detunings:  Mean-Field  Behavior,’  paper  presented  at 
the  Optical  Society  of  America  Annual  Meeting,  Albuquerque,  New  Mexico,  September  20-25, 1992. 
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33  dB.  Potential  applications  of  an  injection-seeded 
gain-saturated  OPA  include  master-oscillator  output 
amplification  and  direct-detection  digital  communi¬ 
cation. 


2.2.2  Theory 

A  detailed  analysis  of  the  injection-seeded  OPA 
including  cavity  detunings  has  been  carried  out.^ 
Both  mean  field  and  noise  behavior  of  the  gain- 
saturated  OPA  and  their  dependence  on  cavity 
detunings,  pump  power,  and  signal  injection  power 
have  been  obtained  and  used  for  comparison  with 
our  experiments.  Direct-detection  statistics  have 
been  obtained  using  the  quantum  analog  of  the 
Gaussian  moment-factoring  theorem,  showing  that 
the  beat  noise  will  be  insignificant  in  our  squeezed- 
amplification  experiments.^  We  have  also  developed 
a  two-detector  extension  of  the  theory  that  permits 
efficient  and  insightful  analysis  of  recent  fourth- 
order  interference  experiments®  which  use  entan¬ 
gled  photons  from  a  parametric  downconverter. 

We  have  continued  our  study  of  the  fundamental 
quantum  limits  on  the  measurement  of  phase.  Our 
early  work  concentrated  on  the  single-mode  case,^ 
in  which  we  showed  that  the  Susskind-Glogower 
probability  operator  measure  (SG-POM)  provides 
the  maximum-likelihood  (ML)  estimate  of  a 
c-number  phase  shift,  and  we  explored  a  variety  o1 
quantum  states  for  use  in  conjunction  with  this  ML 
procedure.  Recently,  we  have  concentrated  on  the 
behavior  of  two-mode  phase  measurements  which 
relieve  the  burden  of  the  single-mode  case's  Paley- 
Wiener  constraint.  Here  we  have  related  quantum 
phase  to  angular  momentum  formalism®  and  eluci¬ 
dated  a  system  for  error-free,  phase-conjugate 
communication  or  finite-precision  phase-sensing.® 
The  latter  study  casts  new  light  on  the  two-mode 


ML  estimation  problem,  viz.,  it  appears  that  there  is 
no  general  ML  phase-measurement  POM  for  arbi¬ 
trary  input  states. 

Finally,  we  have  continued  and  extended  our  the¬ 
oretical  work  on  quantum  propagation  in  single¬ 
mode  fiber.’®  We  now  have  a  formalism  which 
includes  the  Kerr  effect,  i.e.,  self-phase  modulation 
(SPM),  plus  linear  loss,  and  group-velocity  disper¬ 
sion.  This  construct  reduces  properly  to  the  stan¬ 
dard  results  of  classical  theory  and  linearized 
quantum  theory.  Moreover,  we  have  already  per¬ 
formed  numerical  studies  which  establish  the 
validity  limits  of  the  linearized  quantum  theory  for 
pure  SPM,  and  for  SPM  plus  loss,  and  we  are  pro¬ 
ceeding  with  the  richer  case  of  SPM  plus  disper¬ 
sion. 


2.2.3  Publications 

Joneckis,  L.G.,  and  J.H.  Shapiro.  “Quantum  Propa¬ 
gation  in  a  Kerr  Medium;  Lossless,  Dispersion¬ 
less  Fiber.”  J.  Opt.  Soc.  Am.  B.  Forthcoming. 

Shapiro,  J.H.  “On  the  Performance  of  Optical 
Phase-Shift-Keying  with  Preamplify- 
Feedfonward-Homodyne  Reception."  IEEE 
Photon.  Technol.  Lett.  4(6):  647-649  (1992). 

Shapiro,  J.H.,  and  K.X.  Sun.  “Direct  Detection  of 
Gaussian-State  Light.”  Paper  presented  at  the 
Optical  Society  of  America  Annual  Meeting, 
Albuquerque,  New  Mexico,  Septembt-  20-25, 
1992. 

Shapiro,  J.H.,  and  N.C.  Wong.  “Optical  Amplifiers, 
Phase-Shift  Keying,  and  Signal-to-Noise  Ratio.” 
Paper  presented  at  the  Optical  Society  of 


*  K.X.  Sun,  N.C.  Wong,  and  J.H.  Shapiro,  “Squeezed  Amplification  with  Cavity  Detunings;  Mean-Field  Behavior,"  paper  presented  at 
the  Optical  Society  of  America  Annual  Meeting,  Albuquerque,  New  Mexico,  September  20-25,  1992;  K.X.  Sun  and  J.H.  Shapiro, 
"Squeezed  Amplification  with  Cavity  Detunings;  Noise  Behavior,"  paper  presented  at  the  Optical  Society  of  America  Annual  Meeting, 
Albuquerque,  New  Mexico,  September  20-25, 1992. 

5  J.H.  Shapiro  and  K.X.  Sun,  "Direct  Detection  of  Gaussian-State  Light,"  paper  presented  at  the  Optical  Society  of  America  Annual 
Meeting,  Albuquerque,  New  Mexico,  September  20-25, 1992. 

6  A.M.  Steinberg,  P.G.  Kwiat,  and  R.Y.  Chiao,  Phys.  Rev.  Lett.  68(16);  2421-2424  (1992). 

7  J.H.  Shapiro  and  S.R.  Shepard,  “Quantum  Phase  Measurement;  A  System  Theory  Perspective,"  Phys.  Rev.  A  43(7);  3795-3817 
(1991). 

8  S.R.  Shepard,  Phase  of  the  Quantum  Harmonic  Oscillator,  Ph.D.  diss..  Dept,  of  Electr.  Eng.  and  Comput.  Sci.,  MIT,  1992. 

9  J.H.  Shapiro,  "Phase  Conjugate  Quantum  Communication  with  Zero  Error  Probability  at  Finite  Average  Photon  Number,"  submitted  to 
Phys.  Script. 

10  l.G.  Joneckis  and  J.H.  Shapiro.  “Quantum  Propagation  in  a  Kerr  Medium;  Lossless,  Dispersionless  Fiber,"  J.  Opt.  Soc.  Am.  B, 
forthcoming. 
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2.3  Optical  Frequency  Division  and 
Synthesis 
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Frequency  division  and  synthesis  in  the  optical 
domain  play  an  important  role  in  modern  optical 
precision  measurements,  optical  frequency  stan¬ 
dards,  and  optical  communication.  We  have  dem¬ 
onstrated  optical  frequency  division  using  an  optical 
parametric  oscillator  (OPO)  approach  based  on  an 


efficient,  one-step  parametric  downconversion 
process.”  An  OPO  converts  an  input  pump  into  two 
intense,  coherent  subharmonic  outputs  whose  fre¬ 
quencies  are  tunable  and  whose  sum  frequency 
equals  the  pump  frequency.  By  phase  locking  the 
output  frequency  difference  to  a  microwave  source, 
the  output  frequencies  are  precisely  determined, 
and  the  OPO  functions  as  an  optical  frequency 
divider.  OPO  frequency  dividers  can  be  operated 
in  series  or  in  parallel  to  measure,  compare,  and 
synthesize  frequencies  from  optical  to  microwave 
with  high  precision  and  resolution. 


2.3.1  Demonstration  of  Optical  Frequency 
Division 

We  have  employed  a  type-ll  phase-matched  KTP 
OPO  for  demonstrating  the  concept  of  optical 
parametric  division.  A  three-element  cavity  design 
consisting  of  the  KTP  crystal  and  two  cavity  mirrors 
has  been  constructed.  The  mirrors  are  rigidly 
attached  to  a  spacer  block  to  provide  excellent 
mechanical  stability.  This  cavity  design  permits 
systematic  and  continuous  frequency  tuning  of  the 
outputs  and  stable  operation  of  the  OPO.  We  have 
successfully  demonstrated  frequency  division  in  the 
optical  regime  by  phase  locking  the  two  OPO  sub¬ 
harmonic  outputs  at  1.06  urn  to  a  microwave  syn¬ 
thesized  signal  (2-25  GHz)  with  a  residual  beat-note 
linewidth  of  25  mHz,  limited  by  the  resolution  of  the 
spectrum  analyzer.'^  This  indicates  that  the  OPO 
has  very  low  phase  noise,  which  can  be  further 
suppressed  by  phase  locking  it  to  a  microwave 
source.  This  exceptionally  high  spectral  purity 
should  also  allow  us  o  study  the  quantum  phase 
diffusion  noise  of  an  OPO. 


2.3.2  High-Frequency 
Phase-Velocity-Matched  Electro-Optic 
Modulator 

In  order  to  facilitate  phase  locking  of  the  OPO 
outputs  with  a  difference  frequency  of  -1  THz  or 
more,  an  optical  frequency  comb  with  a  large  fre¬ 
quency  span  is  required.  We  have  developed  a 
new  electro-optic  modulator  that  is  capable  of  gen¬ 
erating  hundreds  of  sidebands  at  a  spacing  of 
20-40  GHz.  By  incorporating  a  microwave  wave¬ 
guide  structure  in  a  LiNbOa  electro-optic  modulator. 


”  N.C.  Wong,  "Optical  Frequency  Division  using  an  Optical  Parametric  Oscillator,"  Opt.  Lott.  15(20);  1129-1131  (1990);  D.  Lee  and 
N.C.  Wong,  "Tunable  Optical  Frequency  Division  using  a  Phase-Locked  Optical  Parametric  Oscillator,"  Opt  Lett.  17(1):  13-15  (1992). 

'2  D,  Lee  and  N.C.  Wong,  "Stabilization  and  Tuning  of  a  Doubly  Resonant  Optical  Parametric  Oscillator,"  submitted  to  J.  Opt.  Soc.  Am. 
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the  phase  velocities  of  the  microwave  and  optical 
fields  can  be  matched  to  maximize  the  interaction 
length. vve  have  been  able  to  obtain  a  single-pass 
modulation  index  of  0.4  with  1  W  of  rf  power  at  a 
driving  frequency  of  17.2  GHz.  Our  next  step  is  to 
place  the  modulator  inside  an  optical  resonator  to 
increase  the  effective  interaction  length,  thus 
making  it  possible  to  generate  sidebands  with  a 
span  of  over  1  THz.  Preliminary  results  in  a  12.4 
GHz  modulator  shows  a  one  percent  output  power 
at  the  fifth  order  sideband  at  62  GHz.'^  In  addition 
to  optical  frequency  division,  THz  optical  sideband 
generation  is  potentially  useful  for  frequency  identifi¬ 
cation  in  a  wideband  optical  communication 
network. 


2.3.3  Optical  Frequency  Counter 

We  have  proposed  an  optical  frequency  counter 
that  is  capable  of  measuring  any  optical  frequency 
from  the  IJV  to  the  near  IR  relative  to  a  microwave 
frequency  standard. The  concept  is  to  measure 
the  frequency  difference  of  two  known  ratios  (1/2 
and  2/3)  of  an  optical  frequency  /  relative  to  the 
cesium  clock.  By  employing  a  parallel  network  of 
phase-locked  OPOs  and  wideband  modulators  to 
link  the  (1/2)/  and  (2/3)/  frequencies,  a  precise  and 
accurate  optical  frequency  comb  can  be  provided  in 
the  -1-2-jim  wavelength  region.  By  using  this 
comb,  most  optical  frequencies  from  the  UV  to  the 
near  IR  can  be  measured  or  synthesized.  We  have 
begun  an  experimental  program  to  implement  this 
frequency  counter.  Using  a  Ti:sapphire  laser  at 
-775  nm  (/=387  THz)  as  the  pump  source,  our 
initial  efforts  will  be  to  demonstrate  a  2:1  OPO  fre¬ 
quency  divider  with  subharmonic  outputs  at  1 .5  ^m, 
which  is  also  useful  as  an  alternative  source  for 
fiber-optic  communication. 
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The  experimental  portion  of  this  program  ended  in 
December  1991,  and  the  program  terminated  offi¬ 
cially  when  we  submitted  the  final  report  to  the 
Office  of  Naval  Research  in  August  1992.  During 
1992,  we  expanded  the  theory  of  an  optical  maser 
first  proposed  by  W.E.  Lamb  in  1964,  using  a  trav¬ 
eling  wave  model  to  describe  our  experimental 
polarization  results.  The  experimental  results  on 
spatial  coherence  were  also  more  fully  explained. 


13  B.Y.  Lee,  T.  Kobayashi,  A.  Morimoto,  and  T.  Sueta,  "Picosecond  Electro-Optic  Modulator/Deflector  with  Velocity  Matching."  paper 
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1992,  p.  32;  N.C.  Wong,  D.  Lee,  and  L.R.  Brothers,  "Tunable  Optical  Frequency  Division,"  Digest  ot  the  International  Quantum  Elec¬ 
tronics  Conlerence,  Vienna,  Austria,  June  1992,  p.  110. 
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Because  this  program,  which  was  considered  a 
very  long  shot  when  started  in  1980  under  prede¬ 
cessor  grant  number  N00014-80-C-0941,  was 
becoming  of  commercial  interest,  in  1991  the  prin¬ 
cipal  investigator  requested  that  the  Navy  project 
officer  not  renew  the  grant.  The  last  sentence  of 
the  final  report  to  the  Navy  reads,  "It  is  hoped  that 
ONR,  by  sponsoring  this  high-risk,  long  shot,  has 
significantly  contributed  to  the  industrial  competitive¬ 
ness  of  our  country." 
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2.5  Analog  Processing  of  Optical 
Wavefronts  Using  Integrated 
Guided'Wave  Optics 
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In  wavefront  sensing  and  correction,  it  is  envisioned 
that  10^-10^  basic  modules  would  be  used.  In 
integrated  optics  as  in  integrated  circuits,  it  is 
important  to  relax  the  requirements  for  individual 
components  while  requiring  that  operation  of  the 
integrated  optics  (circuits)  is  independent  of  signif¬ 
icant  component  variations.  The  wavefront  is 
sensed  by  interferometers  between  the  multiplicity 
of  through  waveguides  with  the  arms  of  the  inter¬ 
ferometers  evanescently  coupled  to  adjacent  wave¬ 
guides.  The  input  powers  to  the  interferometer 
arms  will  not  be  equal  as  a  result  of  (1)  the  input 
power  to  the  waveguide  array  being  non-uniform 


and  (2)  unequal  coupling  by  the  evanescent  cou¬ 
plers. 

In  last  year’s  RLE  Progress  Report  (Number  134), 
we  stated  that  the  experimental  results  on  the 
phase  measurement  and  phase  correction  have 
been  independent  of  the  power  difference  between 
the  inteiferometer  arms  up  to  ratios  of  greater  than 
10:1.  It  has  since  been  experimentally  determined 
that  the  phase  measurement  and  the  phase  cor¬ 
rection  are  independent  of  this  power  imbalance  up 
to  ratios  of  50:1 . 

With  this  successful  result  and  those  from  prior 
years,  it  was  decided  in  1992  to  embark  on  building 
a  complete  basic  optical  module.  The  on-chip 
detector  at  the  output  of  the  interferometer  is  also 
incorporated  in  this  planned  module.  In  the  pre¬ 
vious  work,  the  detector  has  been  located  off-chip. 
The  semiconductor  processing  design  we  devel¬ 
oped  ensures  that  the  components  (e.g.,  wave¬ 
guides,  couplers  and  detectors)  of  the  module  are 
fabricated  to  be  compatible.  GaAs/AIGaAs 
epistructures,  which  are  used  for  making  the 
module,  have  been  specified  and  are  on  hand. 
Individual  processing  steps  have  been  specified 
and  optimized.  We  have  received  the 
photolithography  masks,  which  we  designed. 

In  1 992  the  project  dealt  primarily  with  process  defi¬ 
nition  and  experimentation  necessary  for  ongoing 
module  fabrication.  In  the  early  part  of  1993,  we 
will  complete  the  first  basic  modules,  measure  their 
performance,  and  compare  their  operation  with  the¬ 
oretical  predictions. 
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3.1  Facility  for  Millimeter-wave  and  THz 
Frequencies 

Professor  Hu's  laboratory  is  equipped  with  various 
millimeter-wave  and  infrared  sources  which  can 
generate  coherent  and  incoherent  radiation  up  to 
30  THz.  These  include  Gunn  oscillators  at  W-band 
frequencies  (75-110  GHz);  a  frequency  doubler, 
tripler,  and  quadrupler  using  Schottky  diodes  at 
200,  300,  and  400  GHz;  an  optically  pumped  far- 
infrared  laser  which  generates  coherent  radiation  up 
to  8  THz;  and  an  infrared  Fourier  transform 
spectrometer  which  is  capable  of  performing  linear 
spectroscopy  from  45  GHz  to  30  THz  and  beyond. 
The  laboratory  is  also  equipped  with  various 
cryogenic  millimeter-wave  and  infrared  detectors. 
These  include  Si  composite  bolometers,  InSb  h^'t- 
electron  bolometers,  SIS  (superconductor-insul&ior- 
superconductor)  receivers,  and  high-Te  Josephson 
detectors.  There  are  many  infrared  cryostats  which 
can  cool  the  devices  from  0.3  K  to  77  K. 


3.2  Millimeter-wave  and  Infrared 
Superconducting  Focal-plane  Receiver 
Arrays 

Sponsors 
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Although  the  range  that  includes  millimeter-wave 
and  far-infrared  frequencies  is  one  of  the  most 


underdeveloped,  these  frequencies  have  great 
potential  for  applications  in  remote  sensing  and 
communication.  The  millimeter-wave  and  far- 
infrared  frequency  range  falls  between  the  two 
other  frequency  ranges  in  which  conventional  semi¬ 
conductor  devices  are  usually  operated;  (1)  micro- 
wave  frequency  range  and  (2)  near-infrared  and 
optical  frequency  range. 

Semiconductor  devices  which  utilize  the  classical 
diffusive  transport  of  electrons  such  as  diodes  and 
transistors  have  a  high  frequency  limit.  This  limit  is 
set  by  the  time  it  takes  for  electrons  to  travel  a 
certain  distance.  Currently,  electron  mobility  and  the 
smallest  feature  size  which  can  be  fabricated  by 
lithography  limit  the  frequency  range  to  below  100 
GHz.  It  is  not  likely  that  this  limit  can  be  pushed 
much  higher.  Semiconductor  devices  based  on 
quantum  mechanical  interband  transitions,  however, 
are  limited  to  frequencies  higher  than  those  corre¬ 
sponding  to  the  semiconductor  energy  gap,  which  is 
higher  than  10  THz  for  most  bulk  semiconductors. 
Therefore,  a  large  gap  exists  from  100  GHz  to  10 
THz  in  which  very  few  devices  can  operate. 

The  gap  energies  of  conventional  superconductors 
such  as  Nb  are  in  the  range  of  100  GHz  to  2  THz. 
This  coincidence  makes  superconducting  devices 
natural  candidates  for  millimeter-  and  submillimeter- 
wave  applications.  At  millimeter-wave  frequencies, 
the  superconducting  video  detectors  have  quantum 
efficiency  e/^u),  that  is,  a  transport  of  one  electron 
for  one  incoming  photon;  while  the  superconducting 
coherent  receivers  have  their  sensitivities  limited 
only  by  the  zero-point  fluctuation  of  vacuum.  Such 
receivers  have  been  used  widely  in  astrophysical 
studies.  Additional  applications  are  feasible  in 
space-based  communication  and  far-infrared  spec¬ 
troscopy  which  requires  the  ultimate  sensitivity. 
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We  are  currently  developing  a  novel  quasioptical 
scheme  to  couple  the  millimeter-wave  and  infrared 
signals  to  superconducting  devices  through  a  com¬ 
bination  of  an  etched  horn  antenna  and  a  planar 
antenna  supported  by  a  thin  ( -  1  pm)  membrane, 
as  shown  in  figure  1.  This  scheme  combines  the 
advantages  of  the  easy  fabrication  of  lithographic 
thin-film  structures  and  the  high  antenna  efficiencies 
of  horn  antennas.  Because  of  the  absence  of  sub¬ 
strate  losses  in  this  scheme,  it  is  expected  that  a 
THz  receiver  could  be  constructed  using  all-Nb 
superconductor-insulator-superconductor  (SIS)  junc¬ 
tions.  More  important,  because  of  the  cumbersome 
mechanical  structures  used  in  conventional  wave¬ 
guide  technology,  microwave  receivers  have  been 
single-element  devices,  as  opposed  to  CCD 
imaging  arrays  at  optical  frequencies.  Spatial  scan 
has  been  achieved  mechanically.  Using  the  novel 
quasioptical  scheme  mentioned  above,  focal-plane 
detector  arrays  can  be  fabricated  lithographically  on 
a  single  Si  wafer,  as  shown  in  figure  2,  so  that  far- 
infrared  imaging  becomes  feasible. 

The  first  step  of  this  project  is  to  demonstrate  the 
feasibility  of  fabricating  high-quality  SIS  devices  on 
free-standing  SiN  membranes  and  then  to  demon¬ 
strate  that  these  devices  can  survive  thermal 
cycling.  Using  the  microfabrication  facilities  in 
Group  86  at  MIT  Lincoln  Laboratory,  we  have  fabri¬ 
cated  several  wafers  of  SIS  junctions  with  a  critical 
current  density  of  4,000  A/cm^.  We  then  etched  the 
Si  wafer  underneath  anisotropically  in  a  KOH  sol¬ 
ution.  A  view  of  an  SIS  junction  with  a  log-periodic 
antenna  on  a  1-pm  thick  SiN  membrane  is  shown 
in  figure  3.  Extreme  care  was  taken  to  protect  the 
SIS  junctions  from  the  KOH  etchant  during  the 
anisotropic  etching.  Figure  4  shows  the  l-V  curves 
of  the  SIS  junction  measured  before  and  after  the 
Si  wafer  was  etched  away.  There  are  no  noticeable 
changes  in  the  l-V  characteristics  of  the  device. 
Clearly,  our  work  has  established  the  feasibility  of 
building  millimeter-wave  SIS  receivers  using  micro- 
machined  horn  antenna  structures. 


3.3  Photon-assisted  Quantum 
Transport  in  Quantum  Point  Contacts 
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Figure  1.  Example  of  anisotropic  etching  in  a  <100> 
silicon  wafer.  The  opening  of  the  wafer  yields  a  pyramidal 
horn  with  a  flare  angle  of  35.5°. 


-  Antenna  ^  Membrane  Horn 
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Figure  2.  Perspective  view  of  a  two-dimensional  horn 
imaging  array. 


Quantum  transport  has  been  one  of  the  most  active 
fieds  in  solid-state  physics  in  recent  years. 
Advances  in  material  preparation  have  made 
quantum  phenomena  profound  in  electron  transport 
lof  many  semiconductor  quantum  devices  such  as 
quantum  point  contacts,  quantum  dots,  quantum 
wires,  quantum  wells,  superlattices,  etc.  In  clean 
samples  and  at  low  temperatures,  electrons  can 
travel  through  the  whole  sample  without  suffering 
phase-destructive  scattering.  Extensive  work  has 
been  done  to  study  various  features  of  such  phase- 
coherent  quantum  transport.  However,  so  far  the 
experiments  reported  are  limited  to  dc  transport 
measurements  or  far-infrared  spectroscopy 
measurements. 

It  is  well  known  in  the  field  of  superconducting  tun¬ 
neling  that  photons  can  assist  the  tunneling 
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Figure  3.  Perspectives  on  a  silicon  nitride  membrane 
structure,  (top)  View  of  device  seen  through  the  mem¬ 
brane  using  backside  illumination  on  microscope, 
(bottom)  View  of  same  device  using  top-side  illumination; 
the  70°  angle  of  the  anisotropic  etch  can  be  made  out  by 
noticing  the  dark  rim  around  the  border  of  the  device. 

process,  provided  the  tunneling  is  elastic  so  that 
electrons  do  not  suffer  inelastic  scattering.  In  a 
broad  sense,  elastic  tunneling  is  a  phase-coherent 
quantum  transport  in  a  classically  forbidden  region. 
Therefore,  all  the  results  of  photon-assisted  tun¬ 
neling  can  be  applied  to  the  study  of  photon- 
assisted  quantum  transport  in  semiconductor 
devices.  This  will  provide  a  new  dimension  to  study 
the  exciting  quantum  transport  phenomena.  Novel 
long-wavelength  optoelectronic  devices  may  also 
emerge  from  this  research. 

In  this  NSF-sponsored  project,  we  are  studying  the 
interaction  between  far-infrared  photons  and 
ballistic  electrons  in  quantum  point  contact  devices, 


Figure  4.  DC  l-V  curves  for  a  device  on  SiN  membrane 
(fop)  before  etching,  and  (bottom)  after  etching  the  Si 
water  in  KOH. 


whose  schematic  is  shown  in  figure  5(a).  We  have 
fabricated  several  antenna-coupled  quantum  point 
contact  devices  using  a  combination  of  optical  and 
electron-beam  lithography.  Figure  6  shows  the 
SEM  photographs  of  one  of  the  devices.  The  split- 
gate  electrodes  (the  vertical  leads)  also  sen/e  as 
the  terminals  of  a  log-periodic  antenna,  whose  func¬ 
tion  is  to  couple  the  far-infrared  radiation  of  milli¬ 
meter  wavelength  to  the  point  contact  of  submicron 
dimensions. 

The  dc  transport  measurement  of  the  drain/source 
transport  showed  fifteen  quantized  conductance 
steps,  indicating  the  high  quality  of  the  devices. 
Under  a  coherent  far-infrared  radiation  (285  GHz),  a 
profound  photon-induced  drain/source  current  is 
produced  throughout  a  gate  voltage  range  in  which 
the  device  exhibits  the  behavior  of  a  one¬ 
dimensional  electron  system.  The  amplitude  of  the 
photon-induced  current  is  comparable  to  that  corre¬ 
sponding  to  a  quantized  conductance  step,  and  it 
oscillates  with  the  gate  voltage  with  a  period  corre- 
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Figure  5.  (a)  Schematic  of  a  quantum  point  contact,  (b) 
Schematic  of  an  electron  waveguide  which  can  be  used 
to  analyze  the  transport  of  electrons  through  the  quantum 
point  contact  shown  in  (a). 

spending  to  the  gate  voltage  between  adjacent  con¬ 
ductance  steps,  as  shown  in  figure  7. 

Our  preliminary  analysis  suggests  that  the  observed 
far-infrared  photon-induced  drain/source  current  is 
mainly  due  to  heating  (or  a  bolometric  effect).  Since 
the  excitation  in  the  two-dimensional  electron  gas  is 
gapless,  energy  absorption  can  occur  everywhere 
within  the  electron  gas.  Thus,  the  effect  of  photon- 
assisted  quantum  transport  will  always  be  accom¬ 
panied  by  an  effect  due  to  heating.  This  aspect  is 
quite  different  from  photon-assisted  tunneling  in 
superconducting  tunnel  junctions  in  which  the 
superconducting  energy  gap  prevents  energy 
absorptions  in  the  leads.  Our  analysis  also  sug¬ 
gests  several  improvements  must  be  made  to 
enhance  the  effect  of  photon-assisted  quantum 
transport  and  minimize  the  heating  effect. 
According  to  the  recently  deve'oped  theory  of 
photon-assisted  quantum  transport,  the  photon- 
assisted  process  mainly  occurs  near  the  classical 
turning  points  of  the  subbands  under  concern. 
Thus,  to  enhance  the  photon-assisted  quantum 
process,  it  is  crucial  to  concentrate  the  radiation 
field  in  the  central  region  of  the  point  contacts.  This 
requires  use  of  thick  ( -  2000A)  antenna  terminals 
made  of  good  conductors.  The  quantum  effect  can 
also  be  enhanced  by  irradiating  the  devices  at  fre¬ 
quencies  much  higher  than  300  GHz  so  that  the 
photon  energies  are  much  greater  than  the  thermal 
broadening  of  the  Fermi  surface. 


Figure  6.  (a)  SEM  (with  a  magnification  of  400)  of  a 
quantum  point  contact  with  log-periodic  antenna,  (b) 
Central  region  of  the  quantum  point  contact.  The  opening 
of  the  split-gate  electrodes  is  0.15  jim. 

3.4  High-Tc  Superconducting 
Josephson  Devices 

Sponsor 

Defense  Advanced  Research  Projects  Agency 
Contract  MDA972-90-C-0021 

Project  Staff 

Rajesh  K.  Gupta,  Professor  Qing  Hu,  in  collab¬ 
oration  with  Twente  University,  The  Netherlands 


112  RLE  Progress  Report  Number  135 


Chapter  3.  High-Frequency  Electronic  Devices 


10 


8 


S  * 

f  * 


2 


0 

-2.2  -2  -1.8  -1.6  -1.4  -1.2 

VqsM 


Figure  7.  Drain/source  conductance  and  the  photon- 
induced  drain/source  current  as  functions  of  the  gate 
voltage  in  a  region  where  the  conductance  steps 
(measure  without  radiation)  are  well  defined. 

Many  superconducting  analog  devices  have  been 
demonstrated  to  have  higher  sensitivities,  higher 
speed  and  frequency  limit,  and  lower  power  dissi¬ 
pation  than  competing  semiconductor  devices. 
Among  them,  the  most  successful  ones  are  Super¬ 
conducting  Quantum  Interference  Devices 
(SQUIDS)  and  millimeter-wave  detectors. 

The  discovery  of  superconductors  with  a  supercon¬ 
ducting  transition  temperature  higher  than  liquid 
nitrogen  temperature  (high-Te  superconductors)  has 
opened  up  new  and  exciting  possibilities  in  elec¬ 
tronic  device  technology.  A  high  temperature 
version  of  the  superconducting  devices  mentioned 
above  will  find  a  much  wider  range  of  applications 
wherever  refrigeration  is  a  problem.  The  key 
element  of  all  superconducting  analog  and  digital 
devices  is  the  Josephson  junction.  This  junction  is 
formed  by  two  superconductors  weakly  coupled 
together  electrically.  The  supercurrent  flowing 
through  the  Josephson  junction  oscillates  at  the 
Josephson  frequency  f  =  2eV/h  (500  GHz/mV) 
when  a  voltage  is  applied  across  the  junction.  Most 
of  the  useful  applications  of  Josephson  devices 
result  from  this  high-frequency  oscillation.  These 
devices  include  high-frequency  and  high-speed 
signal  processors  and  high-sensitivity  SQUID  mag¬ 
netometers  whose  high  precision  results  from  aver¬ 
aging  over  many  cycles  of  the  Josephson 
oscillations. 

In  collaboration  with  the  Dutch  group  headed  by 
Professor  Horst  Rogalla,  which  is  the  leading  group 
in  fabricating  high-quality  high  Tc  Josephson  junc¬ 
tions,  we  have  studied  the  response  of  a 


YBCQ/PBCOA'BCO  ramp-type  junction  to  coherent 
radiation  at  176  GHz  and  270  GHz.  The  l-V  charac¬ 
teristic  of  the  junction  closely  resembles  the  predic¬ 
tion  of  the  resistively-shunted-junction  (RSJ)  model. 
The  critical  current  and  normal  resistance  product 
IcRn  of  the  junction  is  0.25  mV  at  5  K.  The 
millimeter-wave  radiation  is  coupled  to  the  junction 
via  a  quasioptical  structure  that  focuses  the  radi¬ 
ation  onto  the  junction  through  a  yttrium-stabilized 
ZrQ2  substrate.  At  176  GHz,  we  have  observed  as 
many  as  six  Shapiro  steps  at  the  maximum  power 
level  of  our  Gunn  Qscillator-pumped  frequency 
doubler.  This  implies  a  Josephson  oscillation  up  to 
1  THz.  Shapiro  steps  are  still  visible  up  to  65  K,  as 
shown  in  figure  8. 

The  amplitudes  of  the  zeroth,  first,  and  second 
Shapiro  steps,  as  functions  of  the  square  root  of  the 
radiation  power,  agree  remarkably  well  with  a 
Bessel  function  fit,  as  shown  in  figure  9.  This 
Bessel-like  behavior  is  an  indication  that  the  junc¬ 
tion  is  voltage-biased  at  the  radiation  frequency. 
This  is  in  great  contrast  to  other  experiments 
carried  at  lower  frequencies,  where  a  current-biased 
behavior  is  usually  observed.  For  a  voltage-biased 
RSJ-like  junction,  the  detector  response  can  be  pre¬ 
dicted  analytically.  The  current  responsivity  of  our 
device,  defined  as  the  induced  dc  current  per  unit 
RF  power,  is  calculated  to  be  2  x  10^  AA/V,  which  is 
comparable  to  the  quantum  efficiency  e/hw 
m  1.4  X 10®  AAV  at  this  frequency. 

We  have  also  measured  the  response  of  the 
high-Tc  Josephson  junction  to  coherent  radiation  at 
270  GHz.  At  this  frequency,  due  to  a  combination  of 
the  heavy  RF  loss  in  the  ZrOz  substrate  and  the 
lack  of  radiation  power,  we  have  observed  only  the 
first  Shapiro  step. 


Figure  8.  l-V  curves  of  the  high-Tc  Josephson  junction 
irradiated  by  coherent  radiation  at  176  GHz  and  taken  at 
different  temperatures.  Shapiro  steps  are  still  visible  at  65 
K. 
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Figure  9.  Measured  amplitudes  (normalized  by  the  crit¬ 
ical  current  in  the  absence  radiation)  of  the  zeroth,  first, 
and  second  Shapiro  steps  as  functions  of  the  square  root 
of  radiation  power  at  176  GHz  near  5  K.  The  solid  lines 
are  Bessel  functions. 


3.5  Far-infrared  (THz)  Lasers  Using 
Muitipie  Quantum  Wells 
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Semiconductor  quantum  wells  are  man-made 
quantum  mechanical  systems  in  which  the  energy 
levels  can  be  chosen  by  changing  the  sizes  of  the 
quantum  wells.  Typically,  the  frequency  corre¬ 


sponding  to  the  intersubband  transitions  is  in  the 
far-infrared  or  THz  range.  Naturally,  long- 
wavelength  photoelectric  devices,  such  as  far- 
infrared  lasers,  which  utilize  the  intersubband 
transitions  have  been  proposed  and  subsequently 
studied. 

Significant  progress  has  been  made  recently  toward 
this  goal.  Large  oscillator  strengths  of  intersubband 
transitions  have  been  observed  in  far-infrared 
absorption  spectroscopy  experiments.  An  intersub¬ 
band  spontaneous  emission  with  a  power  level  of 
-  10-^W  has  been  observed.  Although  these  prelim¬ 
inary  results  are  encouraging,  two  major  challenges 
to  building  a  quantum-well  far-infrared  laser  still 
remain.  One  is  to  achieve  a  high  degree  of  popu¬ 
lation  inversion  between  two  subband  levels,  and 
the  other  is  the  confinement  of  photons  within  the 
active  region.  Optical  pumping  suffers  from  low 
efficiency  and  low  modulation  speed.  Thus,  it  will 
be  difficult  to  achieve  a  sufficient  gain  to  maintain  a 
lasing  oscillation,  and,  therefore,  unsuitable  for 
communication  applications.  As  far  as  optical  con¬ 
finement  is  concerned,  the  usual  dielectric  wave¬ 
guide  confinement  method,  commonly  used  in 
optical  and  near-infrared  laser  systems,  is  not  appli¬ 
cable  at  far-infrared  frequencies  because  the  con¬ 
finement  (on  the  order  of  a  wavelength)  is  too  large 
compared  to  the  dimensions  of  the  active  region. 

In  this  project,  we  will  use  a  novel  multiple 
quantum-well  (MOW)  device  to  circumvent  these 
two  problems.  In  this  device,  whose  energy  band 
diagram  is  shown  in  figure  10  with  and  without  a  dc 
bias  voltage,  electrons  are  injected  selectively  into 
an  upper  subband  level  of  a  wide  lasing  quantum 
well  through  a  narrow  filter  quantum  well.  After 
relaxing  into  a  lower  subband  in  the  lasing  well,  the 
electrons  are  then  removed  selectively  to  a  collector 
through  another  narrow  filter  quantum  well.  This 
filter  well  prevents  the  electrons  in  the  upper 
subband  level  from  tunneling  out  to  the  collector. 
Using  this  method  of  selective  injection  and 
removal,  a  high  degree  of  population  inversion  can 
be  achieved,  provided  the  tunneling  rate  of  the 
electrons  from  the  ground  state  to  the  collector  is 
greater  than  the  relaxation  rate  of  the  electrons 
from  the  upper  subband  to  the  ground  state.  Our 
calculation  indicates  that  this  should  be  achieved 
easily. 


114  RLE  Progress  Report  Number  1 35 


Chapter  3.  High-Frequency  Electronic  Devices 


Figure  10.  Band  profiles  (not  drawn  in  proportion)  of  the 
proposed  far-infrared  laser  device  under  (a)  zero-bias, 
and  (b)  lasing  condition  in  which  Veu, 
-  (Ej  -  Et)/e  E  -  20  mV.  The  device  consists  of  one 
narrow  quantum-well  energy  filter  on  both  sides  of  the 
lasing  quantum  well.  The  width  of  the  narrow  filter  wells 
(made  of  lnos3Gao47As)  is  approximately  230  A  so  the 
energy  of  its  first  subband  lies  in  between  the  first  two 
subbands  in  the  wide  lasing  well  whose  width  is  approxi¬ 
mately  360  A,  and  will  have  a  radiation  frequency  of  5 
THz  due  to  the  intersubband  transitions. 

In  the  proposed  MQW  laser  device,  a  parallel-plate 
waveguide  that  confines  the  photons  can  be  formed 
between  a  heavily  doped  semiconductor  injector 
and  collector.  At  a  doping  level  of  2  x  lO’^/cm^,  we 
can  raise  the  plasma  frequency  in  the  injector  and 
collector  to  about  17  THz.  It  has  been  reported  that 
the  electron  mobility,  p,  for  such  a  high  doping  level 
is  5  X  lO^cmWs.  This  plasma  is  mostly  reflective 
with  a  penetration  depth  as  short  as  0.8  pm  for  a 
lasing  frequency  in  the  range  of  1  -8  THz.  This  con¬ 
finement  is  sufficiently  tight  so  that  the  lasing 
threshold  current  density  is  on  the  order  of  several 
hundred  A/cm^,  which  should  be  achievable. 

The  first  step  of  this  project  is  to  fabricate  multiple- 
quantum-well  structures  with  different  well  widths 
and  barrier  thicknesses  and  to  perform  tunneling 
spectroscopy  on  them  to  determine  the  relative 
positions  of  the  subbands  in  different  quantum 
wells.  Because  of  the  small  energy  differences 
which  are  associated  with  the  relatively  wide 
quantum  wells,  the  tunneling  spectroscopy  must  be 
performed  at  liquid  helium  temperatures.  Our  initial 


measurements  on  the  tunneling  l-V  characteristics 
revealed  many  features  which  could  be  attributed  to 
elastic  tunneling  and  LO  phonon-assisted  inelastic 
tunneling. 

To  interpret  the  fine  features  of  the  l-V  character¬ 
istics  with  a  greater  certainty,  we  have  employed  a 
powerful  method,  magnetotunneling  spectroscopy, 
to  study  the  relative  positions  of  the  subbands.  In 
addition  to  spectroscopy,  this  study  has  a  practical 
purpose  for  laser  applications.  The  nonradiative 
intersubband  relaxation  rate  due  to  phonon  emis¬ 
sions  is  much  greater  than  that  of  the  radiative 
relaxation  rate.  This  is  mainly  because  the  trans¬ 
verse  motion  of  electrons  in  the  lasing  quantum  well 
is  unrestricted,  so  there  is  a  large  volume  in  the 
phase  space  into  which  the  electrons  can  be  scat¬ 
tered.  Similar  to  the  concept  employed  in  a 
tokamak  to  confine  plasmas,  a  strong  magnetic  field 
will  restrict  the  electron  motions  in  the  transverse 
directions.  This  magnetic  field-induced  transverse 
confinement,  along  with  the  longitudinal  confine¬ 
ment  provided  by  the  quantum  well,  will  effectively 
reduce  the  dimensionality  of  the  electron  system  to 
that  of  a  zero-dimensional  system,  exactly  like  that 
of  quantum  dot  systems.  Therefore,  the 
nonradiative  relaxation  rate  can  be  substantially 
reduced.  Because  the  magnetic  field  does  not 
affect  the  longitudinal  motion  of  electrons,  electrical 
pumping  is  still  possible.  This  is  difficult  to  imple¬ 
ment  for  quantum  dot  systems  which  are  formed 
geometrically. 

We  have  performed  magnetotunneling  spectroscopy 
on  a  double  quantum-well  structure  which  consists 
of  115-A  and  200-A  lno53Gao47As  quantum  wells, 
both  are  confined  by  30  A  of  InoszAlo^As  respec¬ 
tively  on  the  top  and  bottom  and  separated  by  a 
60-A  lno52Alo48As  middle  barrier.  When  a  longi¬ 
tudinal  magnetic  field  is  applied,  the  tunneling 
current  is  modulated  by  the  field,  as  shown  in  figure 
11(a).  The  magnetooscillation  is  periodic  with  the 
inverse  of  the  magnetic  field  1/B.  Thus,  a  Fourier 
transform  of  this  magnetooscillation  in  1/B  gives  a 
peak  which  centers  at  a  characteristic  field  Bq. 

The  physical  origin  of  this  magnetooscillation  is 
scattering-mediated  inter-Landau  level  tunneling. 
Suppose  the  energy  in  the  first  well  is  given  by 
El  ■¥  (n,  -h  1/2)AeB/m*,  and  the  energy  in  the 
second  well  is  Ej  -i-  (n2  +  1/2)ftB/m*,  where 
El  and  E2  are  the  energies  of  the  subbands  in  the 
first  and  second  well  under  concern.  In  the  absence 
of  LO  phonon  emissions,  energy  conservation 
requires  that  the  two  energies  are  equal.  Thus, 
(n2  -  n,)/fteB/m*  =  El  -  E2  or  1/B  =  (02  -  n,) 
Ae/m*(E,  -  E2).  This  implies  that  whenever  1/B  is  an 
integer  times  the  inverse  of  a  characteristic  field 
1/Bo  =  fie/m*(Ei  -  E2),  the  energy  conservation  is 
satisfied  and  the  tunneling  current  will  be  enhanced. 
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which  gives  the  oscillatory  behavior  of  the  tunneling 
current  as  a  function  of  1/B,  as  shown  in  figure 
11(a). 

This  magnetooscillation  provides  us  with  very 
important  information  about  the  relative  positions  of 
the  subbands  under  concern.  The  energy  difference 
between  the  two  subbands  is  related  to  Bo  through 
E,  -  E2  =  fieBo/m*.  Thus,  the  energy  difference  can 
be  measured  directly  using  the  magnetotunneling 
spectroscopy.  Since  the  energy  difference  E,  -  E2 
changes  linearly  with  the  bias  voltage,  so  is  the 
characteristic  field  Bg.  By  measuring  Bo  as  a  func¬ 
tion  of  the  bias  voltage,  we  can  determine  the  frac¬ 
tion  of  the  total  applied  bias  voltage  dropped  across 
the  two  wells. 


Fundamental  Frequency  B,  [T] 


Figure  11.  (a)  Tunneling  current  (at  a  fixed  bias  voltage) 
as  a  function  of  the  strength  of  a  magnetic  field  applied  in 
the  same  direction  as  that  of  the  tunneling  current,  (b) 
Fourier  transform  of  the  tunneling  current  as  a  function  of 
1/B. 
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1.1  Introduction 
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Our  objectives  are  to  develop,  using  renormaliza¬ 
tion-group  theory  and  other  methods  of  statistical 
mechanics,  microscopic  theories  of  quantum  elec¬ 
tronic  systems  and  of  semiconductor  alloy  systems 
that  properly  account  for  correlation  effects.  Our 
approach  is  particularly  suited  to  systems  with  fluc¬ 
tuations  due  to  finite  temperatures,  impurities,  sur¬ 
faces  or  other  geometric  constraints. 


*  1.2  Renormalization-Group  Approach  to 
Eiectronic  Systems 

Project  Staff 

Alexis  Falicov,  Professor  A.  Nihat  Berker 

®  High-Tc  superconductivity,  metallic  magnetism,  the 

metal-insulator  transition,  and  heavy  fermion 
behavior  are  all  phenomena  produced  by  the  strong 
correlations  of  electrons  in  narrow  energy  bands.  It 
is  therefore  important  to  study  theoretical  models 
that  incorporate  the  strong  correlation  effects  of 

•  electrons.  The  tJ  model  is  such  a  system.  It  is 
defined  on  a  lattice  with  one  spherically  symmetric 
orbital  per  site  by  the  Hamiltonian 


P[-t  Z  (c^  Cj^,  +  h.c.) 
<  ij  > ,  o 


where  P  is  an  operator  that  projects  out  all  doubly 
occupied  sites,  c,^„  and  Ck,  are  creation  and  annihi¬ 
lation  operators  for  an  electron  in  a  Wannier  state 
at  site  i  with  z-component  of  spin  o,  ni  is  the  elec¬ 
tron  number  operator  that  counts  the  total  number 
of  electrons  at  site  i,  and  Si  is  the  spin  operator. 
This  Hamiltonian  can  be  interpreted  in  two  ways: 

(1)  In  the  case  of  small  J,  the  system  can  be 
thought  of  as  the  large  U  limit  of  the  single-band 
Hubbard  model  of  electronic  systems.  The  antifer¬ 
romagnetic  exchange  comes  as  a  result  of  a  virtual 
process  where  one  electron  hops  onto  a  singly 
occupied  nearest-neighbor  site  and  then  hops  back. 
The  energy  gain  for  such  a  process  is  of  the  order 
of  tVU  since  a  doubly  occupied  site  has  energy  U. 

(2)  The  system  can  also  be  thought  pf  as  an  elec¬ 
tronic  system  with  a  "super”  exclusion  principle 
where  no  two  electrons  (like  or  unlike  spins)  are 
allowed  on  the  same  site.  The  second  interpreta¬ 
tion  does  not  put  any  restriction  on  the  size  of  J. 

There  are  few  rigorous  results  available  on  this 
system:  (1)  At  half  filling,  the  system  reduces  to  a 
Heisenberg  antiferromagnet;  (2)  At  2t-±J,  the 
model  has  been  solved  by  the  Bethe-ansatz  tech¬ 
nique;  and  (3)  In  one  dimension  and  J-0,  the 
model  has  been  solved  also  by  the  Bethe-ansatz 
technique.  Other  attempts  to  study  this  model  have 
focused  on  the  ground  state  and  the  first  few 
excited  states.  We  have  decided,  instead,  to  focus 
on  the  thermodynamic  properties  of  this  system. 
Our  method,  the  renormalization-group  approach, 
involves  solving  a  statistical  mechanics  problem  by 
a  recursive  elimination  of  the  degrees  of  freedom. 
Since  the  Hamiltonian  involves  a  regular  lattice,  this 
problem  is  well  suited  for  the  position-space  renor¬ 
malization-group  method.  The  solution  is  obtained 
in  an  expanded  space  with  Hamiltonian 


+  J 


Z  (Si-Sj 
<ij> 


nin/4)]  P, 
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Jf'  - 


P[-t  2  {4cj„  +  h.c.) 

<  ij  > ,  o 


+  2  (JSj  Sj  +  Vnjrij)  +  [i2jnj]  P. 

<ij>  ' 

Because  of  the  non-commutativity  of  quantum  oper¬ 
ators,  we  had  to  develop  new  techniques  of  renor¬ 
malization-group  theory. 

We  have  obtained  the  renormalization-group  flows 
for  the  effective  coupling  constants  for  one-,  two-, 
and  three-dimensional  systems.  The  flows  deter¬ 
mine  the  phase  diagrams  and  all  thermodynamic 
properties.  In  one  dimension,  we  find,  as  expected, 
no  finite-temperature  phase  transition.  In  two 
dimensions,  we  find  a  single  finite-temperature  crit¬ 
ical  point,  as  previous  researchers  conjectured  but 
were  unable  to  derive. 

In  three  dimensions,  we  find  a  global  finite- 
temperature  phase  diagram  that  is  richly  structured. 
Disordered,  antiferromagnetic,  and  conducting 
phases  are  separated  by  first-  and  second-order 
phase  boundaries,  which  are  themselves  delineated 
by  lines  of  tricritical  points,  critical  points,  and  crit¬ 
ical  endpoints.  Complicated  structures  with  multiple 
reentrances  occur.  We  have  also  calculated,  using 
our  renormalization-group  transformation,  electronic 
densities  and  nearest-neighbor  correlations  as  a 
function  of  electronic  chemical  potential.  These 
yield  a  microscopic  understanding  of  the  different 
phases.  We  have  also  completed  small-cluster  cal¬ 
culations  which  cannot  yield  phase  transitions,  but 
which  support  the  densities  and  correlations  of  the 
renormalization-group  theory. 

1.3  Suzuki-Trotter  Approach  to 
Electronic  Systems 
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It  is  possible  to  rigorously  map  d-dimensional 
quantum  mechanical  spin  systems  onto 
(d+1)-dimensional  classical  systems  with  more 
complicated  interactions.  This  method,  introduced 
by  Suzuki,  uses  the  Trotter  formula  and  is  the 
analog  for  statistical  mechanics  of  the  Feynman- 
path  approach  to  particle  propagators. 

We  are  investigating  the  possibility  of  adapting  this 
approach  to  electronic  systems  where,  in  more  than 


one  dimension,  fermion  statistics  complicates  the 
classical  problem.  By  numerical  simulation,  the 
resulting  classical  models  will  be  used  to  confirm 
the  renormalization-group  results  mentioned  above. 


1.4  Phase  Diagrams  of  Semiconductor 
Alloys 
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We  are  conducting  a  study  of  ternary  compounds 
on  face-centered-cubic  lattices.  Our  aim  is  to  eluci¬ 
date  the  phase  behavior  of  ternary  and  quaternary 
semiconductor  alloys.  These  alloys  could  exist  in 
the  zincblende,  chalcopyrite,  or  possibly  stannite 
structures,  involving  two  interpenetrating  fee  lattices 
on  which  up  to  four  atomic  species  exist.  One 
atomic  species  occupies  one  of  the  fee  lattices 
while  three  other  atomic  species  may  (chalcopyrite, 
stannite)  or  may  not  (zincblende)  order  on  the  other 
fee  lattice.  At  present,  the  chalcopyrite  structure  is 
seen  experimentally  in  the  lll-V  compounds.  The 
ranges  of  stability  of  these  structures  are  of  interest. 

In  previous  work  by  K.E.  Newman  and 
collaborators,  the  Blume-Emery-Griffiths  model  has 
been  adopted  for  the  study  of  the  zincblende  to 
chalcopyrite  or  stannite  transitions.  This  model  is  a 
spin-1  Ising  model  with  Hamiltonian 

jr.J  2  S|Sj  +  K  2  spsf 
<  ij  >  <  ij  > 

-  A2sf,  Sj-0,±  1. 
i 

The  three  spin  values  are  each  associated  with  a 
different  species  of  atom.  A,  B,  or  C,  which  exist  on 
one  of  the  fee  lattices.  The  other  fee  lattice  is  con¬ 
sidered  occupied  by  atomic  species  D.  The 
systems  under  consideration  have  the  composition 
[(AB)i-,C2,]D2.  The  model  includes  interactions 
between  the  A,  B,  and  C  atoms.  The  parameters  J 
and  K  in  the  Hamiltonian  above  are  fixed  as  combi¬ 
nations  of  these  interaction  energies.  They  are 
chosen  to  give  the  chalcopyrite  structure  at  low 
temperature  (J  <  0)  and  to  control  the  phase  transi¬ 
tion  between  the  chalcopyrite  and  the  zincblende. 
A  controls  the  relative  densities  of  the  species  (AB) 
and  C. 

These  systems  contain  competing,  frustrated  inter¬ 
actions,  which  necessitate  a  renormalization-group 
treatment.  We  have  been  able  to  develop  a  renor¬ 
malization-group  transformation  that  yields  both  the 
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ferromagnetic  and,  in  contrast  with  previous  work, 
the  (fully  frustrated)  antiferromagnetic  phase  transi¬ 
tions  on  the  fee  lattice  in  the  spin-1/2  limit  obtained 
by  taking  A  to  negative  infinity.  We  are  now  pur¬ 
suing  the  global  phase  diagram  in  thermodynamic 
field  (interaction  constant)  space,  which  will  be  fol¬ 
lowed  by  a  study  of  density  space. 


1.5  Renormalization-Group  Theory  of 
an  Internal  Critical  Endpoint  Structure 
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A  prefaced  renormalization-group  study  has  been 
completed,  indicating  that  all  three  new  features  of 
the  Blume-Emery-Griffiths  model  with  repulsive 
biquadratic  interaction  survive  fluctuations  in  three 
dimensions  but  not  in  two  dimensions.  These  fea¬ 
tures  are:  (1)  a  critical  endpoint  structure  occurring 
inside  the  ferromagnetic  phase;  (2)  disordered- 
ferromagnetic-disordered  reentrance;  and  (3)  a 
ferrimagnetic  phase  sandwiched  between  the  ferro¬ 
magnetic  and  antiquadrupolar  phases.  The  renor¬ 
malization-group  mechanism  for  a  critical  endpoint 
structure  inside  the  ordered  phase  is  shown  to  be 
the  same  as  that  originally  found  for  a  critical 
endpoint  structure  inside  the  disordered  phase, 
namely  a  distinct  hybrid  fixed  point,  contrary  to  a 
previous  claim.  This  critical-endpoint  structure  may 
have  been  seen  experimentally  in  the  analogous 
alloy  system  of  FeBra 

1.6  Phase  Diagrams  of  the  Ising  Model 
on  the  Square  Lattice  with  Crossed 
Diagonal  Bonds 
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The  global  phase  diagram  of  the  spin-1 /2  Ising 
model  with  nearest-  and  next-nearest-neighbor 
interactions  on  the  square  lattice  has  been  consid¬ 
ered,  including  the  fully  anisotropic  cases.  A 
closed-form  expression  has  been  deduced  that 
accurately  represents  the  phase  boundaries  when 
the  crossed  next-nearest-neighbor  interactions  are 
not  of  opposite  signs.  The  phase  diagram  includes 
disordered,  ferromagnetic,  antiferromagnetic,  axially 
single-striped,  and  diagonally  double-striped 
phases. 
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2.1  Introduction 
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In  this  research  program  we  use  modern  x-ray  scat¬ 
tering  techniques  to  study  structures  and  phase 
transitions  in  thin  films  and  on  surfaces.  We  have 
two  principal  experimental  facilities,  one  at  MIT  and 
the  other  at  the  National  Synchrotron  Light  Source 
at  Brookhaven  National  Laboratory.  At  MIT,  we 
have  four  high-resolution  computer-controlled  x-ray 
spectrometers.  The  angular  resolution  can  be 
made  as  fine  as  1 .8  seconds  of  arc,  which  enables 
us  to  probe  the  development  of  order  from  dis¬ 
tances  of  the  order  of  the  x-ray  wavelength  -  lA, 
up  to  30,000  A.  The  sample  temperature  can  be 
varied  between  2  K  and  500  K  with  a  relative  accu¬ 
racy  of  2x10-^  K.  At  the  National  Synchrotron 
Light  Source,  in  collaboration  with  IBM,  we  have 
three  fully  instrumented  beam  lines.  1  wo  of  these 
beam  lines  allow  us  to  make  studies  with  photons 
varying  in  energy  between  3  and  12  keV;  the  third 
has  a  fixed  energy  of  17  keV.  These  facilities  make 
possible  high-resolution  scattering  experiments  with 
a  flux  of  more  than  three  orders  of  magnitude  larger 
than  that  of  a  rotating  anode  x-ray  generator. 

Several  years  ago,  as  part  of  this  JSEP  program, 
we  built  an  x-ray-compatible,  high-vacuum  single 
crystal  apparatus.  This  enabled  us  to  use 
synchrotron  radiation  to  study  the  structures  and 
transitions  occurring  at  a  single  surface,  and  such 
experiments  are  now  becoming  routine.  In  collab¬ 
oration  with  Professor  Simon  G.J.  Mochrie,  we  have 
now  designed,  built  and  commissioned  a  second- 
generation  x-ray  surface  facility  which  is  extremely 
versatile  allowing  a  wide  range  of  surface  scattering 
experiments. 

Our  basic  scientific  objective  is  to  understand  the 
morphologies  and  microscopic  structures  of  simple 
semic':nductor  and  metal  surfaces  at  high  temper¬ 


ature.  Possible  phase  changes  include  surface 
roughening,  surface  reconstruction,  melting, 
amorphization  and  dilution.  These  phenomena  are 
particularly  interesting  on  stepped  surfaces,  where 
there  may  be  an  interplay  between  step  structures, 
faceting,  reconstruction,  and  roughening. 


2.2  Metal  Surface  Studies 

Step  interactions  play  a  central  role  in  determining 
the  equilibrium  morphology  of  a  crystalline  surface. 
Different  interactions  are  known  to  result  in  such 
diverse  phenomena  as  step  pairing,  surface 
faceting,  and  surface  roughening.  A  small  density 
of  steps  may  be  incorporated  onto  a  metal  surface 
by  deliberately  cutting  the  crystal  at  an  angle 
slightly  off  from  a  high  symmetry  direction.  If  the 
high  symmetry  surface  does  not  reconstruct,  the 
theory  of  equilibrium  crystal  shapes  (ECS)  predicts 
that  at  low  temperatures  the  steps  should  either 
cluster  together  or  spread  uniformly  across  the 
surface.  In  the  former  case,  a  "hill  and  valley" 
structure  of  densely  stepped  rough  regions  coex¬ 
isting  with  flat,  high  symmetry  facets  will  result. 
The  fraction  of  the  surface  comprised  of  flat  facets 
should  decrease  with  increasing  temperature,  falling 
to  zero  at  some  temperature  below  the  roughening 
transition  temperature  of  the  facet.  In  the  latter 
case,  the  surface  is  predicted  to  be  rough  at  all 
temperatures.  That  is,  entropically  wandering  steps 
should  result  in  a  logarithmic  divergence  of  the 
mean  square  height  fluctuations  as  a  function  of 
lateral  distance  across  the  surface.  X-ray  scat¬ 
tering,  low  energy  electron  diffraction  and  atom 
scattering  studies  have  reported  surface  roughening 
of  numerous  metal  crystalline  facets,  including 
Cu(110),  Ag(110),  Cu{113)  and  Ni(113).  In  addi¬ 
tion,  studies  of  vicinally  miscut  Cu(110)  and 
Ag(110)  have  reported  low  temperature  surface 
phase  separation  into  a  hill  and  valley  structure. 

The  situation  for  Ag{110)  is,  however,  controversial. 
In  1987  we  carried  out  a  high  resolution  x-ray  scat- 
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tering  study  of  a  vicinal  Ag(110)  surface  using  a 
crystal  which  had  been  inadvertently  miscut  by 
-0.5°:  we  interpreted  the  data  on  the  basis  of  a 
roughening  transition  of  the  flat  (110)  facets  at  - 
725  K.  More  recently,  Robinson  and  coworkers 
have  re-addressed  this  problem  using  a  Ag(i10) 
crystal  miscut  by  ~  0.2°.  They  assert  that,  in  equi¬ 
librium,  their  0.2°  vicinal  Ag(110)  surface  phase 
separates  into  flat  (110)  facets  together  with 
stepped  regions  for  temperatures  below  790  K  and 
that  a  uniform  step  structure  is  only  stable  above 
that  temperature.  In  order  to  resolve  this  contro¬ 
versy  and  to  understand  generally  the  behavior  of 
stepped  Ag(110)  surfaces,  we  have  carried  out 
x-ray  scattering  and  scanning  tunneling  microscopy 
(STM)  studies  of  a  Ag(110)  sample  miscut  1°  from 
the  (110)  axis  along  the  (110)  direction.  The  STM 
measurements  have  been  performed  at  IBM  by 
R.M.  Feenstra  and  G.A.  Held. 

For  a  clean  and  well  annealed  surface,  we  find  that 
at  all  temperatures  above  300  K,  the  surface  is 
comprised  of  a  uniform  density  of  single  atomic 
height  meandering  steps.  That  is,  we  find  no  evi¬ 
dence  for  vicinal  phase  separation.  However,  as 
the  surface  is  annealed  to  sufficiently  high  temper¬ 
atures,  we  find  that  impurities  are  driven  to  the 
surface  and  that  this,  in  turn,  results  in  the  forma¬ 
tion  of  large,  flat  (110)  terraces.  The  latter  is  the 
behavior  reported  by  Robinson  and  coworkers 
although  they  interpreted  their  data  assuming  a 
clean  surface.  For  the  clean  surface  we  observe 
diffraction  characteristic  of  uniform,  albeit 
meandering,  steps  at  all  temperatures  between  300 
K  and  875  K.  With  increasing  temperature,  the 
step  peaks  lose  intensity  and  undergo  an  evolution 
in  lineshape  consistent  with  a  decrease  in  step-step 
correlations,  most  noticeably  in  the  (001)  direction. 
While  our  data  above  670  K  are  consistent  with  a 
rough  surface,  the  peak  evolution  is  not  consistent 
in  detail  with  the  simplest  model  of  entropically 
repulsive  steps. 


2.3  Semiconductor  Surface  Studies 

As  discussed  above,  vicinal  surfaces,  that  is,  sur¬ 
faces  of  crystals  cut  close  to  a  high  symmetry  direc¬ 
tion,  exhibit  remarkably  rich  and  diverse  physical 
behavior.  At  equilibrium,  such  surfaces  may  exhibit 
roughening,  faceting,  and/or  reconstruction  transi¬ 
tions.  Novel  non-equilibrium  phenomena  also  may 
occur.  One  of  t  i  most  interesting  and  well-studied 
systems  is  vicinal  Si(111).  At  temperatures  imme¬ 
diately  above  the  1  X  1  to  7  X  7  reconstruction  tran¬ 
sition  at  1100  K,  this  surface  can  be  described  by 
flat  (111)  terraces  separated  by  uniformly  spaced 
single  Si(111)  bilayer  steps.  In  recent  synchrotron 
x-ray  experiments  we  demonstrated  that  the  "uni¬ 


formly"  stepped  surface  is  in  fact  rough  with  loga¬ 
rithmically  divergent  height  fluctuations.  Below 
1 1 00  K  the  1  X  1  -» 7  X  7  transition  drives  a  faceting 
transition  in  which  the  steps  phase-separate 
yielding  flat,  reconstructed  (111)  terraces  and  well- 
ordered  but  higher-angle  stepped  regions. 

We  have  determined  the  temperature  dependence 
of  the  surface  roughness  above  1100  K,  a  regime 
where  significant  numbers  of  atoms  sublimate. 
Remarkably,  the  surface  height  fluctuations  exhibit 
a  logarithmic  divergence  up  to  1275  K  even  under 
significant  sublimation.  This  is  best  repre.'  ^nted  by 
a  power-law  line  shape  diffraction  profile  o  the  step 
lattice  characterized  by  the  exponent  q.  r.^presen- 
tative  power  law  diffraction  profiles  are  snown  in 
figure  1.  At  -  1275  K,  the  surface  undergoes  a 
sharp  morphological  transition  to  one  characichzed 
by  large  uniform  (ill)  terraces  plus  highly  disor¬ 
dered  stepped  regions.  This  faceting  was  appar¬ 
ently  observed  previously,  although  on  a  Si(111) 
surface  with  a  very  small  miscut  (8’)  compared  with 
a  3°  miscut  on  our  Si(1 1 1 )  sample. 

Our  x-ray  experiment  reveals  that  the  faceting  is 
accompanied  by  a  step  disordering  transition  in 
which  the  step  correlations  change  drastically. 
Above  the  disordering  transition  the  height  fluctu¬ 
ations  of  the  surface  diverge  much  faster  than  the 
thermal  logarithmic  divergence.  Data  on  the 
stepped  region  diffraction  peak  position  and  width 
are  shown  in  figure  2.  We  speculate  that  the  "tran¬ 
sition"  is  a  non-equilibrium  phenomenon  driven  by  a 
combination  of  sublimation  and  electromigration. 
Our  results  are  consistent  with  theoretical  predic¬ 
tions  for  growing  stepped  surfaces.  The  discovery 
of  this  new  transition  on  thoroughly  studied  and 
structurally  perfect  silicon  surfaces  should  cast  light 
on  the  general  problems  of  surface  morphology  at 
high  temperatures. 


2.4  Model  Surface  Overlayers 

Much  of  the  basic  physics  of  surface  systems, 
including  both  clean  surfaces  and  surfaces  with 
physisorbed  or  chemisorbed  overlayers,  is  deter¬ 
mined  by  general  structural  and  symmetry  consider¬ 
ations.  For  this  reason,  experiments  on  model 
systems  such  as  rare  gas  overlayers  on  graphite 
can  provide  important  insights  with  implications  for 
all  surface  systems.  An  important  illlustration  of  this 
is  the  chiral  3-states  Potts  model  description  of  the 
melting  of  the  1  x  3  reconstruction  of  Si(113)  as 
described  by  Simon  G.J.  Mochrie  in  this  report. 
The  theory  for  this  semiconductor  surface  transition 
was  ominaIN  developed  to  describe  the  melting  of 
the  y3  x^3  structure  of  monolayer  krypton  on 
graphite. 
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Figure  1.  Longitudinal  scans  of  the  primary  step 
superlattice  peak  for  a  3°  miscut  Si(111)  crystal  below 
1270  K  with  background  subtracted.  The  scans  are 
plotted  relative  to  the  integer  order  peak  position  of  the 
(111)  facets.  The  solid  curves  are  the  results  of  fits  to  a 
power  law  singularity  form  for  the  step  correlations. 
There  is  no  change  in  the  lineshape  in  this  temperature 
range  other  than  sharpening  due  to  annealing. 

Recently,  we  have  carried  out  a  series  of  exper¬ 
iments  on  xenon  overlayers  physisorbed  onto  single 
crystal  graphite.  These  experiments  have  focused 
on  two  issues:  (1)  the  evolution  of  the  structure  as 
a  function  of  temperature  for  one,  two  ’hree  and  six 
layers;  and  (2)  the  nature  of  the  melting  transition 
for  coverages  in  the  monolayer  region.  Results  of 
both  sets  of  experiments  are  still  being  analyzed. 

The  essential  features  of  the  multilayer  studies  may 
be  summarized  as  follows.  Monolayer  xenon  exhi¬ 
bits  a  commensurate-incommensurate  transition  at 
about  65  K.  Bilayer  xenon  exhibits  more  elaborate 
behavior.  For  temperatures  above  60  K,  the  two 
layers  have  the  same  lattice  constant  with  a  value 
incommensurate  with  respect  to  the  substrate.  At  ~ 
60  K  the  layer  of  xenon  closest  to  the  graphite 
locks  into  registry  with  the  substrate  while  the 
second  layer  remains  incommensurate.  On  the 
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Figure  2.  Longitudinal  and  transverse  scans  of  the  step 
superlattice  peak  across  the  disordering  transition  near 
1275  K.  The  solid  curves  are  the  results  of  fits  of  the 
step  peak  to  an  anisotropic  displaced  Lorentzian.  Note 
the  drastic  change  in  the  width  of  the  longitudinal  scans. 

Other  hand,  the  three-  and  six-layer  xenon  systems 
remain  uniform  and  incommensurate  at  all  temper¬ 
atures. 

Monolayer  xenon  on  graphite  has  been  viewed  as  a 
model  system  for  studying  melting  in  two  dimen¬ 
sions.  There  is,  however,  disagreement  in  the 
interpretation  of  thermodynamic  and  scattering  data 
on  the  order  of  the  melting  transition  at  various  cov¬ 
erages.  Specifically,  scattering  studies  suggest  a 
continuous  melting  process  whereas  themodynamic 
data  have  been  interpreted  on  the  basis  of  first 
order  melting  transition  for  coverages  near  but 
below  one  monolayer.  We  have  studied  the  melting 
as  a  function  of  pressure  and  hence  chemical 
potential  at  fixed  temperatures  of  121  K,  140  K,  and 
146  K.  At  the  two  higher  temperatures,  we  find  that 
the  solid  diffraction  peak  decreases  progressively  in 
intensity  with  decreasing  pressure  evolving  contin¬ 
uously  into  the  diffraction  pattern  characteristic  of  a 
strongly  correlated  fluid  with  well-developed 
orientational  order.  This  necessitates  that  the 
melting  transition  is  at  least  nearly  continuous.  We 
are  currently  carrying  out  a  detailed  line-shape 
analysis  of  the  diffraction  profiles  to  obtain  quantita¬ 
tive  information  on  the  melting  transition. 
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3.1  A  New  Mechanism  for  Dissociative 
Chemisorption  on  Si:  Atom  Abstraction 
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F2  is  known  to  adsorb  on  Si(IOO)  at  300  K  with 
close  to  unit  probability  in  the  zero  coverage  limit. 
We  have  showi  that  the  adsorption  mechanism 
involves  both  the  common  mechanism  for 
dissociative  adsorption,  whereby  both  fluorine 
atoms  of  the  molecule  adsorb,  and  a  new  mech¬ 
anism,  fluorine  atom  abstraction.  In  the  abstraction 
mechanism,  the  Si  surface  captures  one  of  the 
fluorine  atoms  from  the  incident  F2  molecule,  while 
the  other  F  atom  scatters  back  into  the  gas  phase. 

This  observation  was  made  in  a  UHV  molecular 
beam  surface  scattering  apparatus  by  monitoring 
the  directly  scattered  F  atom  signal  with  a  triply 
differentially  pumped  quadrupole  mass 
spectrometer.  The  scattered  F  atom  signal  is 
readily  distinguishable  from  the  F*  signal  due  to 
cracking  of  the  unreacted  F2  in  the  electron 
bombardment  ionizer  by  its  time  of  flight  spectrum. 
The  directly  scattered  F  atoms  are  significantly 
more  energetic,  indicating  that  some  of  the 
exothermicity  of  this  reaction  is  channeled  into 
translation. 

We  have  also  taken  measurements  of  the  abstrac¬ 
tion  mechanism  as  a  function  of  the  incident  F2 
energy,  incident  angle,  surface  temperature,  and 
fluorine  coverage.  In  addition,  we  have  shown  that 
the  Si  dangling  bonds  are  the  sites  for  fluorine 


adsorption  and  are  probably  responsible  for  the 
abstraction  mechanism.  This  was  accomplished  by 
directing  a  well-characterized,  monoenergetic  beam 
of  He  atoms  at  the  surface  covered  with  a 
monolayer  of  fluorine  and  monitoring  the  angular 
distribution  of  diffracted  atoms.  The  results  of  He 
diffraction  measurement  show  that  the  fluorine  does 
not  break  the  dimer  bond  characteristic  of  (2x1) 
reconstruction.  While  the  binding  of  fluorine  to  the 
dangling  bonds  along  with  the  maintenance  of  the 
surface  Si  dimer  bond  is  not  surprising,  this  study 
represents  the  experimental  evidence  for  these 
adsorption  sites. 

The  reaction  of  F2  with  Si{100)  ceases  after  the 
dangling  bonds  are  saturated  at  one  monolayer  of 
coverage.  This  lack  of  additional  reactivity  pre¬ 
cludes  build  up  of  a  sufficient  layer  of  fluorine  to 
produce  the  volatile  etch  product  SiF«  and  therefore 
the  use  of  Fj  as  an  etchant  of  Si.  However,  we 
have  shown  that  if  the  kinetic  energy  of  the  incident 
F2  molecule  is  increased  above  a  threshold  value  of 
4-5  kcal  mol  ’  (0.2  eV),  the  dissociation  probability 
of  F2  with  a  fluorinated  surface  increases  linearly 
with  the  normal  component  of  the  kinetic  energy. 

This  additional  reactivity  at  higher  translational 
energies,  which  results  in  fluorine  coverages 
greater  than  one  monolayer,  is  the  result  of  sur¬ 
mounting  a  barrier  to  dissociation  on  the  fluorinated 
surface.  Measurements  of  the  He  atom  diffraction 
spectra  as  a  function  of  the  incident  F2  kinetic 
energy  indicate  that  the  barrier  is  associated  with 
dissociative  chemisorption  on  the  second  layer  of  Si 
atoms  as  well  as  on  Si  dimer  atoms.  These  conclu¬ 
sions  are  based  on  the  identical  rate  of  linear  decay 
for  all  diffraction  features  with  increasing  F2  normal 
energy  above  the  threshold.  There  is  no  preferen¬ 
tial  cleavage  of  the  Si  dimer  atoms.  These  obser¬ 
vations  may  represent  the  first  experimental 
correlation  of  the  barrier  for  dissociation  with  the 
sites  for  dissociation. 
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3.2  New  Mechanisms  for  Surface 
Processes 
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We  have  recently  shown  that  hydrogen  can  be 
embedded  in  the  bulk  of  Ni  by  a  new  mechanism, 
collision  induced  absorption.  A  beam  of  monoener- 
getic  Ar,  Kr,  or  Xe  atoms,  produced  by  the  super¬ 
sonic  expansion  of  the  inert  gas  seeded  in  He,  is 
directed  at  a  monolayer  of  atomic  hydrogen 
chemisorbed  on  Ni(111).  We  observed  that 
impacts  of  the  incident  inert  gas  atoms  induce 
absorption  of  chemisorbed  hydrogen.  The  hydrogen 
absorbed  is  identified  by  appearance  of  a 
vibrational  mode  at  800  cm- '  and  appearance  of  a 
feature  at  180-220  K  in  the  thermal  desorption 


spectrum.  This  is  consistent  with  our  previous  study 
of  atomic  hydrogen  absorption  into  this  crystal. 

We  have  extended  this  work  to  include  a  study  of 
the  dynamics  of  collision  induced  absorption.  We 
probe  these  dynamics  by  measuring  the  cross 
section  for  absorption  of  chemisorbed  H  induced  by 
the  impact  of  Ar,  Kr,  or  Xe  atoms  as  a  function  of 
their  incident  energy  and  angle.  The  cross  section 
is  calculated  from  measurements  of  absorption  rate 
in  the  limit  of  zero  absorbance. 

The  absorption  rate  is  measured  in  the  following 
manner.  An  inert  gas  atom  beam  is  directed  at  the 
crystal  covered  with  a  H  monolayer.  A  partial  pres¬ 
sure  of  2x10-®  torr  of  Hj  is  present  in  the  chamber 
during  exposure  to  the  inert  gas  atom  beam.  The 
pressure  of  H2  is  necessary  to  replace  the  H  lost 
from  the  surface  as  the  result  of  absorption  and  a 
competing  process,  collision  induced  recombinative 
desorption.  In  addition,  maintenance  of  a  layer  of 
surface  H  blocks  the  exchange  between  surface 
and  bulk  H  that  is  observed  to  be  also  induced  by 
collisions  of  the  inert  gas  atoms.  At  the  end  of  the 
exposure  to  the  inert  gas  atom  beam,  the  ambient 
H2  is  pumped  away,  and  a  thermal  desorption 
measurement  is  carried  out.  The  amount  of  H 
absorbed  is  determined  by  the  integrated  area  of 
the  low  temperature  feature.  This  measurement  is 
made  for  Ar,  Kr,  and  Xe  atoms  incident  over  a  wide 
range  of  angles  and  total  kinetic  energies. 

The  collision  induced  absorption  cross  section  is 
observed  to  depend  linearly  on  the  normal  compo¬ 
nent  of  the  kinetic  energy  of  the  incident  Ar,  Kr,  and 
Xe  atoms.  Strict  normal  energy  scaling  of  the  cross 
section  is  observed,  and  the  cross  section  for 
absorption  induced  by  Xe  is  significantly  larger  than 
that  for  Kr  or  Ar.  These  two  observations  suggest 
that  the  cross  section  for  collision  induced  absorp¬ 
tion  is  directly  proportional  to  the  energy  transfer  to 
the  Ni  lattice; 

Z(Ej  cpi  m)  «  A  AE(E|  (pj  m) 

where  AE  is  the  energy  transferred  from  the  inert 
gas  atom  to  the  lattice,  E,  is  the  incident  energy,  (p, 
is  the  incident  angle,  m  is  the  mass  of  the  inert  gas 
atom  and  A  is  a  proportionality  constant. 

To  test  whether  the  absorption  cross  section  does 
correlate  with  energy  transfer  to  the  lattice,  the  inci¬ 
dent  energy,  incident  angle,  and  mass  dependence 
of  the  energy  transferred  from  the  inert  gas  atom  to 
the  Ni  lattice  is  calculated  and  compared  to  that  of 
the  experimentally  determined  cross  section.  The 
energy  transfer  is  calculated  within  a  hard  sphere- 
hard  cube  impulsive  collision  model  which  has  been 
modified  to  include  the  multiple  collisions  that  take 
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place  when  a  heavier  gas  atom  is  incident  on  a 
lighter  substrate  atom.  The  mass  of  the  Ni  atom  is 
taken  as  its  atomic  mass.  Motion  of  the  Ni  atom  is 
only  allowed  normal  to  the  macroscopic  plane  of 
the  surface,  which  is  known  to  be  an  excellent 
approximation  in  an  atom-surface  collision.  Once 
the  Ni  atom  has  suffered  its  first  collision  with  the 
inert  gas  atom,  it  is  assumed  to  recoil  instantane¬ 
ously  back  toward  the  incoming  inert  gas  atom. 
However,  the  Ni  atom  recoils  with  only  a  fraction 
(0.8)  of  its  momentum  acquired  by  collision  with  the 
inert  gas  atom.  The  rest  of  the  momentum  is  lost 
to  the  lattice  and  in  this  way  energy  is  dissipated  to 
the  lattice  in  this  calculation.  A  second  collision  of 
the  recoiling  Ni  atom  and  the  incoming  inert  gas 
atom  occurs,  and  their  new  momenta  are  calcu¬ 
lated.  Again,  the  Ni  atom  is  assumed  to  recoil 
instantaneously  with  a  fraction  (0.8)  of  its 
momentum  after  the  second  collision.  A  third  colli¬ 
sion  may  occur,  depending  on  the  mass  of  the 
incoming  inert  gas  atom.  The  total  energy  trans¬ 
ferred  is  calculated  by  summing  the  energy  lost  to 
the  lattice  by  the  recoiling  Ni  atom  after  each  colli¬ 
sion  and  the  final  energy  of  the  Ni  atom. 

The  energy  transferred  to  the  lattice  by  Xe,  Kr,  and 
Ar  is  plotted  as  a  function  of  their  normal  energy  on 
the  same  plot  as  the  experimental  cross  sections. 
The  energy  transferred  by  Xe  is  scaled  to  the 
experimental  cross  section  for  Xe,  allowing  the  pro¬ 
portionality  factor  A  to  be  determined.  The  propor¬ 
tionality  factor  is  then  used  to  predict  the  cross 
sections  for  absorption  by  Ar  and  Kr  based  on  the 
energy  transferred  to  the  lattice  by  Ar  and  Kr.  The 
experimental  dependence  of  the  cross  sections  on 
the  mass  and  normal  energy  agrees  very  well  with 
that  of  the  cross  sections  predicted  on  the  basis  of 
the  amount  of  energy  transferred  to  the  lattice.  The 
one  adjustable  parameter  in  this  model,  the  fraction 
of  momentum  with  which  the  Ni  atom  recoils  away 
from  the  lattice,  is  set  at  0.8,  but  this  parameter 
gives  good  agreement  with  the  data  over  a  wide 
range  of  values  between  0.1  to  0.95  for  a  Ni  mass 
of  58.7.  This  excellent  correlation  between  the 
cross  section  for  collision  induced  absorption  and 
the  energy  transferred  to  the  Ni  lattice  suggest  a 
surface  deformation  mechanism.  The  incident  atom 
collides  with  the  surface  either  directly  on  top  of  or 
close  to  the  adsorbed  H  atom.  The  impact  of  the 
collision  transfers  energy  to  the  surface  which 
momentarily  distorts  it  to  open  up  channels  or  holes 
into  which  the  hydrogen  can  readily  diffuse  and 
enter  the  bulk. 
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3.3  The  Chemistry  of  Bulk  Hydrogen 
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Using  molecular  beam  techniques  coupled  with  high 
resolution  electron  energy  loss  spectroscopy,  we 
have  recently  demonstrated  the  synthesis  and 
detection  of  hydrogen  embedded  in  the  bulk  of  Ni 
under  low  pressure,  UHV  conditions.  These  capa¬ 
bilities  have  enabled  the  chemistry  of  bulk  hydrogen 
to  be  probed  unambiguously  for  the  first  time.  We 
have  found  that  the  bulk  hydrogen  atom  has  unique 
chemistry.  In  particular,  we  have  shown  that  the 
species  reactive  with  adsorbed  CH3  is  not  a  surface 
bound  H  atom  but  a  bulk  H  atom. 

The  experiment  is  performed  as  follows.  Exposure 
of  the  Ni(111)  crystal  to  atomic  hydrogen  not  only 
results  in  hydrogen  embedded  within  the  Ni  lattice 
but  also  in  a  monolayer  of  hydrogen  adsorbed  on 
the  surface.  The  surface-bound  hydrogen  blocks 
sites  necessary  for  the  second  reactant,  adsorbed 
CH3  that  we  synthesize  by  the  dissociative  chem¬ 
isorption  of  CH4.  Therefore,  the  H  adsorbed  on  the 
surface  must  be  removed.  The  removal  of  the 
surface-bound  H  cannot  be  effected  thermally 
because  it  is  more  stable  than  bulk  H,  recombining 
and  desorbing  between  340-390  K.  Therefore, 
removal  of  the  surface-bound  H  by  the  non-thermal 
process,  collision  induced  recombinative  desorption, 
is  necessary.  A  100  kcal/mol  Xe  atom  beam  is 
directed  at  the  surface  at  a  glancing  angle.  The 
impacts  of  the  Xe  atoms  deform  the  surface  and 
allow  the  hydrogen  to  recombine  and  desorb  as  Hj. 
In  this  manner,  the  surface  is  swept  clean  of 
hydrogen  while  the  bulk  hydrogen  remains.  The 
clean  surface  is  then  exposed  to  CH4  incident  at  17 
kcal/mol.  The  CH4  dissociatively  adsorbs, 
producing  adsorbed  CH3  and  adsorbed  hydrogen. 
With  the  reactants  now  synthesized,  the  surface 
temperature  is  ramped  at  2K/s  and  the  partial  pres¬ 
sures  of  hydrogen  and  methane  are  measured.  At 
exactly  the  same  temperature  at  which  hydrogen 
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desorbs  from  the  bulk,  180  K,  there  is  very  intense 
desorption  of  CH4.  We  have  also  carried  out  this 
reaction  with  a  deuterium  isotope  embedded  in  the 
Ni  lattice.  In  this  case,  the  sole  product  observed  is 
CH3D.  There  is  no  formation  of  CH4  or  higher 
deuterated  methanes.  In  the  absence  of  bulk 
hydrogen,  no  CH4  is  observed  to  desorb. 

Methane  is  formed  solely  by  the  reaction  with  bulk 
hydrogen.  The  surface  bound  hydrogen  is 
unreactive  with  CH3.  The  reaction  likely  proceeds 
by  the  direct  recombination  of  a  bulk  H  atom  with 
CH3  because  the  interstitial  octahedral  site  in  which 
the  hydrogen  atom  is  bound  is  directly  beneath  the 
threefold  hollow  site  on  which  the  CH3  is  bound.  As 
the  surface  temperature  is  raised,  the  bulk 
hydrogen  atom  moves  up  toward  the  surface  where 
it  encounters  the  methyl  radical.  Because  the 


hydrogen  atom  now  has  the  correct  orientation 
required  by  the  transition  state  for  sp^  hybridization, 
it  reacts  with  CH3  and  immediately  desorbs  as  CH4. 
The  reaction  of  CH3  with  an  adsorbed  H  atom  pro¬ 
bably  does  not  occur  because  access  of  the  H 
atom  to  the  Ni3-C  bond  is  blocked.  This  result  doc¬ 
uments  a  new  mechanism  for  a  surface  reaction — a 
reaction  between  an  adsorbed  and  a  bulk 
species — and  it  unambiguously  demonstrates  the 
importance  of  bulk  hydrogen  as  a  reactant  in  a  het¬ 
erogeneous  catalytic  reaction. 
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4.1  Introduction 
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Understanding  the  properties  of  surfaces  of  solids 
and  the  interactions  of  atoms  and  molecules  with 
surfaces  has  been  of  extreme  importance  both  from 
.chnological  and  academic  points  of  view.  The 
recent  advent  of  ultrahigh  vacuum  technology  has 
made  possible  microscopic  studies  of  well- 
characterized  surface  systems.  The  way  atoms 
move  to  reduce  the  energy  of  the  surface,  the 
number  of  layers  of  atoms  involved  in  this  reduc¬ 
tion,  the  electronic  and  vibrational  states  that  result 
from  this  movement,  and  the  final  symmetry  of  the 
surface  layer  are  of  utmost  importance  in  arriving  at 
a  fundamental  and  microscopic  understanding  of 
the  nature  of  clean  surfaces,  chemisorption  pro¬ 
cesses,  and  the  initial  stages  of  interface  formation. 

The  theoretical  problems  associated  with  these 
systems  are  quite  complex.  However,  we  are  cur¬ 
rently  at  the  forefront  of  solving  the  properties  of 
real  surface  systems.  In  particular,  we  are  contin¬ 
uing  our  efforts  in  developing  new  techniques  for 
calculating  the  total  ground-state  energy  of  a 
surface  system  from  "first  principles,"  so  that  we 
can  provide  accurate  theoretical  predictions  of 
surface  geometries  and  behavior.  Our  efforts  in 
this  program  have  concentrated  in  the  areas  of 
surface  growth,  surface  reconstruction  geometries, 
structural  phase  transitions,  and  chemisorption. 


4.2  Surface  Reconstruction 

The  (7x7)  reconstruction  of  Si{111)  is  perhaps  the 
most  complex  and  widely  studied  surface  of  a  solid. 
Since  its  discovery  through  low-energy  electron 
diffraction  (LEED)  more  than  thirty  years  ago,  an 
enormous  amount  of  effort  has  been  expended  to 
elucidate  the  properties  of  this  important  surface. 
Based  on  this  work,  it  is  now  generally  accepted 
that  the  geometry  of  the  (7x7)  reconstruction  is 
described  by  the  dimer-adatom-stacking-fault  (DAS) 
model  as  proposed  by  Takayanagi  et  al.  The  com¬ 
plexity  of  this  geometry,  however,  has  defied  any 
complete  and  realistic  theoretical  treatment  of  its 
properties.  The  only  progress  that  could  be  made 
theoretically  was  by  isolating  and  modeling  bits  and 
pieces  of  the  surface.  The  only  attempt  at  a  com¬ 
plete  work  used  an  empirical  tight-binding  model  to 
study  the  Si(111)-(7x7)  reconstruction  in  a  supercell 
geometry  with  196  atoms. 

In  this  work,  we  have  exploited  the  power  of  the 
state  of  the  art  in  parallel  computation  to  demon¬ 
strate  the  feasibility  of  performing  ab  initio  calcu¬ 
lations  with  supercells  approching  1000  atoms. 
Specifically,  we  have  performed  the  first  ab  initio 
calculation  of  the  Si(111)-(7x7)  using  a  supercell 
geometry  with  700  effective  atoms.  These  calcu¬ 
lations  predict  the  fully  relaxed  atomic  geometry  of 
this  system,  allow  construction  of  theoretical  scan¬ 
ning  tunneling  microscope  (STM)  images  as  a  func¬ 
tion  of  bias  voltages,  and  predict  the  energy 
difference  between  the  (7x7)  and  (2x1)  recon¬ 
structions. 

The  DAS  model  for  the  Si(111)-(7x7)  surface  recon¬ 
struction  is  shown  in  figure  1.  The  unit-cell 
boundary  is  outlined  by  a  dashed  line.  The  largest 
solid  circles  denote  the  twelve  atoms.  The  smaller 
solid  circles  denote  six  rest  atoms  that  lie  one  layer 
below  the  surface  and  are  threefold  coordinated. 
There  is  also  a  single  corner  hole  in  each  unit  cell. 
Together,  these  19  atoms  account  for  the  19 
unbonded  electrons  on  the  reconstructed  surface. 


Chapter  4.  Semiconductor  Surface  Studies 


Figure  1.  (a)  Top  view  of  the  DAS  model  for  the  Si(1 1 1  )-(7x7)  surface  reconstruction.  The  (7x7)  unit  cell  is  outlined. 
Atoms  at  increasing  distances  from  the  surface  are  indicated  by  circles  of  decreasing  size.  The  large  solid  circles 
denote  the  twelve  adatoms,  the  smaller  solid  circles  represent  the  rest  atoms.  The  faulted  half  of  the  unit  cell  is  on  the 
left.  Small  open  circles  denote  the  dimers,  while  small  solid  circles  and  dots  represent  atoms  in  the  unreconstructed 
layers,  (b)  Side  view.  Atoms  on  the  lattice  plane  along  the  long  diagonal  of  the  surface  unit  cell  are  shown  with  larger 
circles  than  those  behind  them.  The  numbers  label  the  37  unique  atoms  in  the  four-layer  slab.  All  others  are  related  by 
symmetry  operations. 


The  smallest  solid  circles  denote  the  fourth  layer  of 
atoms  below  the  surface  on  the  unfaulted  half  of 
the  unit  cell.  Nine  dimers  are  present  two  layers 
below  the  surface.  One  dimer  is  labeled  by  atoms 
with  the  number  16  and  17.  The  side  view  shows 
that  the  stacking  sequence  in  the  right  half  of  the 
unit  is  the  same  as  in  bulk  Si,  while  the  stacking 
sequence  in  the  left  half  is  faulted. 

Our  model  system  consists  of  a  superceil  with  a 
slab  geometry  containing  vacuum  on  txith  sides. 
Periodic  boundary  conditions  are  applied  in  all 
directions.  The  slab  contains  the  adatom  layers 
and  four  surface  layers  shown  in  figure  1(b)  plus  a 
mirror  image  reflection  in  the  vertical  direction. 


Thus,  the  supercell  consists  of  eight  layers  of  atoms 
with  adatom  layers  on  both  slab  surfaces  and  a 
region  of  10  A  of  vacuum.  The  400  silicon  atoms 
and  the  vacuum  layer  make  this  supercell  equiv¬ 
alent  to  a  700-atom  system.  Unit-cell  dimensions 
are  22  A  X  2  A  X  26  A. 

All  atoms  were  allowed  to  move  freely  except  the 
innermost  layer,  which  was  frozen  in  its  bulk  posi¬ 
tion.  Thus,  our  model  of  the  surface  includes  the 
relaxation  of  three  surface  layers  in  addition  to  the 
adatom  layer.  The  atoms  were  assumed  to  be  in 
their  fully  relaxed  positions  when  the  forces  on  the 
ions  were  converged  to  0.15  eV/A. 
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Table  1  shows  the  relaxed  ionic  positions.  The 
computed  positions  in  the  surface  plane  are  nearly 
identical  with  those  from  the  tight-binding  calcu¬ 
lations.  The  most  significant  differences  involve 
expansion  of  the  surface  layers  away  from  the  bulk 
layer.  Compared  to  the  tight-binding  calculations, 
the  adatoms  move  -0.03  A  away  from  the  bulk 
layer.  The  rest  atoms  move  -0.02  A  toward  the 
bulk  layer.  Most  of  the  remaining  atoms  move 
-0.05  A  away  from  the  bulk  layer.  However, 
atoms  directly  below  adatoms  move  only  -  0.03  A, 


Atom 

;ir 

Y 

2 

Adatoms 

1 

1.500 

1.500 

1.662 

2 

4.492 

4.491 

1.594 

3 

6.002 

6.002 

1.555 

4 

9.005 

9.005 

1.603 

First-layer 

5 

1.033 

1.033 

-0.057 

atoms 

6 

1.958 

0.993 

-0.058 

7 

2.481 

2.481 

0.348 

8 

4.021 

4.022 

-0.155 

9 

4.974 

4.050 

-0.099 

10 

5.980 

5.057 

-0.135 

II 

6.478 

6.478 

-0  163 

12 

8.019 

8.019 

0.345 

13 

9.029 

8.059 

-0.104 

14 

9.472 

9.472 

-0.1  II 

Second-layer 

15 

1.504 

1.504 

-1.512 

atoms 

16 

1.162 

0.006 

-1.057 

17 

1.800 

0.007 

-1.042 

18 

2.492 

1.499 

-0.867 

19 

2.969 

2.969 

-0.863 

20 

4.494 

4.495 

-1.561 

21 

5.403 

4.780 

-1.046 

22 

6.004 

6.004 

-1.567 

23 

7.532 

7.532 

-0.871 

24 

8.505 

7.510 

-0.874 

25 

8.998 

8.998 

-1.537 

Third-layer 

26 

0.001 

0.001 

-3.986 

atoms 

27 

1.500 

1.500 

-4.416 

28 

1.006 

-0.002 

-3.989 

29 

1.990 

0.000 

-3.964 

30 

2.499 

1,499 

-3.946 

31 

2.997 

2.997 

-3.951 

32 

4.499 

4.499 

-4.460 

33 

5.496 

4.512 

-3.973 

34 

5,999 

5.999 

-4.440 

35 

7,497 

7.497 

-3.933 

36 

8.505 

7.496 

-3.930 

37 

9.001 

9.001 

-4.414 

Table  1.  Relaxed  atomic  positions  for  the  adatom  layer 
and  the  first  three  surface  layers  for  the  ab  initio  (7x7) 
calculation.  The  coordinate  system  is  the  same  as  in  ref¬ 
erence  10.  All  reduced  coordinates  (X,  Y,  Z)  are  with 
respect  to  the  Cartesian  system  indicated  in  figure  1, 
where  the  x  axis  is  along  the  cubic  [110]  direction,  and 
the  y  axis  is  along  the  [111]  direction  outward  normal  to 
the  surface.  The  actual  atomic  coordinates  (x,  y,  z)  are 
related  tOL(X,  Y,  ZjJxy  the  scaling  relations  x  =  aX, 
y  =  aY/^  ,  z  =  aZ/y  24  ,  where  a-3.85  A  is  the  (1x1) 
surface  hexagonal  lattice  constant. 


resulting  in  a  more  distorted  adatom  configuration. 
The  atoms  two  layers  directly  below  move  the  most 
compared  to  the  tight-binding  calculations,  relaxing 
-0.10  A  away  from  the  bulk  layer.  The  relative 
heights  of  adatoms  1 ,  2,  3,  and  4  are  0.085,  0.031 , 
0.000,  and  0.038  A,  respectively.  Experimental 
LEED  values  are  0.120,  0.080,  0.0,  and  0.040. 

Having  generated  the  electronic  wave  functions  in 
our  pseudopotential  calculation,  we  preceded  to 
make  theoretical  STM  figures.  In  figure  2,  we 
compare  the  theoretical  results  (top)  with  corre¬ 
sponding  experiments  at  a  tip  voltage  of  2  V 
(bottom)  for  both  occupied  and  unoccupied  states. 

The  theoretical  STM  probe  was  placed  3.8  A  above 
the  surface  adatom  layer.  The  probe  height  is 
limited  by  the  presence  of  a  second  surface  layer 
above  the  probe  due  to  the  periodic  replication  of 
the  unit  cell.  The  theoretical  grey  scale  was 
matched  to  the  experimental  grey  scale  at  the  tops 
of  the  adatoms  and  bottoms  of  the  corner  holes, 
and  linearly  interpolated  in  between.  Experiments 
show  a  variation  of  2-3  A  corresponding  to  this 
scale.  The  corresponding  theoretical  variation  is  5 
A.  Depths  greater  than  3  A  are  set  to  minimum 
intensity  in  order  to  match  the  experimental  grey 
scale. 

The  most  fundamentally  intereresting  part  of  this 
calculation  is  the  comparison  of  formation  energies 
of  the  (7x7)  reconstruction  and  the  (2x1)  surface. 
Cleavage  of  Si  to  create  the  (111)  surface  results  in 
a  (2x1)  metastable  structure.  This  surface  must 
then  be  annealed  to  generate  the  stable  (7x7) 
structure.  We  find  that  the  (7x7)  reconstruction  is 
energetically  favorable  over  the  metastable  (2x1) 
surface  by  60  meV  per  (141)  cell.  Our  absolute 
values  are  1.179  eV  for  the  (7x7)  surface  and  1.239 
eV  for  the  (2x1). 

The  calculations  discussed  in  this  section  were 
made  possible  because  of  the  considerable  compu¬ 
tational  power  associated  with  massively  parallel 
architectures.  The  calculations  elevate  ab  initio 
investigations  to  a  new  echelon  and  demonstrate 
that  studies  of  complex  material  systems  with 
supercells  containing  nearly  an  order  of  magnitude 
more  atoms  than  previously  possible  are  now  fea¬ 
sible. 


4.3  Computation  at  Large  Length 
Scales 

As  we  discussed  in  the  previous  section,  with 
recent  advances  in  highly  parallel  computer  archi¬ 
tecture,  calculations  on  systems  with  hundreds  of 
atoms  are  now  tractable.  This  not  only  permits 
more  comfortable  extraction  of  the  thermodynamic 
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Figure  2.  STM  images  of  the  Si(1 1 1)-(7x7)  surface  reconstruction  based  on  the  results  of  this  calculation  (top  panels) 
and  the  experimental  results  of  Avouris  and  Wolkow  (bottom  panels).  Shown  are  unoccupied  (left  panel)  and  occupied 
(right  panel)  state  contributions.  The  determination  of  the  grey  scale  in  the  top  panels  is  described  m  the  text. 


limit  but  also  extends  the  qualitative  level  of  com-  for  integration  of  the  equations  of  motion  to 

plexity  in  accessible  problems.  However,  with  approach  zero  as  the  system  size  increases.  While 

growing  interest  in  calculations  involving  larger  the  onset  of  this  instability  depends  on  the  physics 

systems,  it  has  been  demonstrated  that  as  the  of  the  system  under  study,  the  effect  has  been 

longest  physical  length  sale  in  a  calculation  demonstrated  dramatically  in  a  silicon  system  of  six 

increases,  integration  of  the  fictitious  electronic  primitive  cells  ( ~  46  A  long)  at  8  Ry,  To  fully 

equations  of  motion  becomes  dominated  by  fluctu-  benefit  from  the  next  generation  of  computer  tech- 

ations  in  the  electronic  Hartree  energy  ("charge  oologies,  an  approach  free  of  charge  sloshing  is 

sloshing").  These  fluctuations  force  the  time  step  needed. 
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Recently,  conjugate-gradient  procedures  introduced 
by  Teter,  Payne,  Allan,  and  Gillan  for  relaxing  the 
electronic  wave  functions  at  fixed  atomic  arrange¬ 
ments  have  been  shown  not  to  exhibit  the  afore¬ 
mentioned  instability  and  to  consume  an  order  of 
magnitude  less  CPU  time  than  the  best  fictitious 
electronic  dymamics  schemes.  Despite  this 
improvement,  performing  a  dynamics  calculation 
with  conjugate-gradient  techniques  is  very  costly 
because  of  the  tight  tolerances  to  which  the  elec¬ 
tronic  problem  must  be  solved  (typcially  10 
pV/atom)  to  yield  stable  atomic  trajectories. 

Very  recently  we  have  overcome  these  difficulties 
and  introduced  a  new  technique  for  performing 
finite-temperature  ab  initio  total-energy  calculations 
at  long  length  scales.  The  method  involves  the 
introduction  of  a  new  class  of  energy  functionals 
and  an  efficient  scheme  for  performing  interative 
solutions. 

To  demonstrate  the  practicality  of  the  new  tech¬ 
nique  for  long-length-scale  systems,  we  have  per¬ 
formed  tests  using  a  system  that  is  well  over  50  A 
long.  In  particular,  we  have  determined  the  phonon 
spectrum  of  Si  through  the  velocity-velocity  autocor¬ 
relation  function  of  the  ions  in  our  calculation.  After 
projecting  the  ionic  displacements  from  our  calcula¬ 
tion  onto  the  allowed  k  states  and  the  known  purely 
longitudinal  and  transverse  polarization  vectors,  we 
then  determine  the  frequencies  present  in  the  time 
of  autocorrelation  functions  of  these  spatial  Fourier 
coefficients  with  the  maximum  entropy  method. 
Taking  the  frequencies  of  the  peak  values  of  the 
resulting  spectra,  we  compare  our  parameter-free 
results  with  the  experimentally  measured  phonon 
frequencies  in  figure  3  and  find  excellent  agree¬ 
ment.  Our  spectra  clearly  and  accurately  reproduce 
not  only  the  celebrated  flattening  of  the  lower 
acoustic  (TA)  modes  as  one  moves  away  from  the 
r  point  but  also  the  delicate  splitting  of  the  optic 
modes  along  A.  Note  that  the  calculation  also 
accurately  reproduces  the  lowest  frequency 
acoustic  mode  despite  its  period  of  nearly  1  ps  and 
manifests  the  delicate  splitting  of  the  optic  modes, 
which  beat  against  each  other  with  even  longer 
periods.  This  illustrates  that  our  dynamics  tech¬ 
nique  produces  reliable  ionic  trajectories  even  over 
periods  of  picoseconds  and  is  thus  very  precise. 
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5.1  Structure  and  Phase  Behavior  of 
the  Si(113)  Surface 
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Over  the  last  year,  we  have  pursued  a  comprehen¬ 
sive  x-ray  scattering  study  of  the  structure  and 
phase  behavior  of  the  Si(113)  surface  in  collab¬ 
oration  with  Professor  R.J.  Birgeneau.  The  results 
will  allow  us  to  determine  the  atomic  structure  of 
the  (3x1)  reconstructed  Si(113)  surface.  More¬ 
over,  it  is  already  clear  that  our  results  represent  an 
important  advance  for  our  basic  understanding  of 
order-disorder  transformations  in  model  two- 
dimensional  systems. 

In  theoretical  models  of  an  overlayer  on  a  sub¬ 
strate,  it  is  possible  to  identify  four  generic  phases. 

(1)  Commensurate  (C)  solids  are  characterized  by 
long-range  translational  order  and  a  wavevector 
which  is  commensurate  with  the  substrate  lattice. 

(2)  C-fluids  are  also  characterized  by  a 
commensurate  wavevector,  but  their  translational 
correlation  length  is  finite.  (3)  Incommensurate  (1C) 
floating  solids  exhibit  a  wavevector  which  is  incom¬ 
mensurate  with  the  substrate  and  an  infinite  corre¬ 
lation  length.  (4)  Finally,  IC-fluids  show  an 
incommensurate  wavevector  and  a  finite  correlation 
length. 


1  M.  Schick.  Surf.  Sci.  1 1 ;  245  (1981). 

2  J.M,  Kosterlitz  and  D.J.  Thouless,  J.  Phys.  C  6:  1 181  (1973). 

3  V.L.  Pokrovski  and  A.L.  Talapov,  Phys.  Rev.  Lett.  42;  65  (1979). 
<  J.  Villain,  Surf.  Sci.  97:  219  (1980). 

5  V.J.  Emery  and  I.  Peschel.  Z  Phys.  B  A3:  241  (1981). 


The  critical  behavior  at  the  phase  transformations 
between  these  different  structures  is  largely  known; 
C-solid-to-fluid  transformations  are  described  by 
p-state  Potts  models,  where  p  is  the  order  of  the 
commensurability  (which  is  defined  as  the  period  of 
the  overlayer  measured  in  units  of  the  substrate 
lattice  constant).'  IC-solid-to-fluid  melting  is 
Kosterlitz-Thouless  melting.*  The  transformation 
between  a  C-solid  and  an  IC-solid  is  a  Pokrovski- 
Talapov  transformation  for  uniaxial  phases*  and  dis¬ 
continuous  in  the  hexagonal  case.'*  C-fluids  and 
IC-fluids  are  separated  by  so-called  disorder  lines.® 

Each  of  the  above  mentioned  transformations  is 
well  understood  theoretically.  The  situation  is  more 
complicated  for  the  transformation  from  a  C-solid 
into  an  IC-fluid,  where  the  order  of  the  commensur¬ 
ability  (p)  is  key  in  determining  the  expected 
behavior.  For  p  s:4,  there  is  general  agreement 
that  an  IC-solid  phase  always  separates  the  C-solid 
and  IC-fluid  phases.  For  p=2,  there  is  agreement 
that  near  the  critical  temperature  for  the 
C-solid-to-IC-fluid  transformation  the  correlation 
length,  incommensurability,  susceptibility,  and 
order  parameter  show  power-law  behavior.  This 
behavior  i^ characterized  by  the  same  critical  expo¬ 
nents  (v,  p,  y,  p,  respectively)  as  found  for  the 
C-solid-to-fluid  transformation,  i.e.,  the  universality 
class  of  the  transformation  is  that  of  the  2-state 
Potts  model.  In  contrast,  for  p=3  the  character  of 
the  transformation  from  a  C-solid  into  an  IC-fluid 
phase  remains  unresolved.  There  are  three  pos¬ 
sible  scenarios.  The  first  is  that  a  direct 
C-solid-to-IC-fluid  transformation  cannot  occur; 
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instead,  an  IC-solid  phase  inevitably  intervenes.® 
The  second  possibility  is  that  there  is  a  direct  trans¬ 
formation,  which  falls  in  the  same  universality  class 
as  the  C-solid-to-fluid  transformation  (3-state  Potts 
universality  class). ^  Finally,  Huse  and  Fisher®  have 
suggested  that  there  may  be  a  direct  transforma¬ 
tion,  but  that  it  is  in  a  new  chiral  melting  universality 
class,  with  as-yet-unknown  critical  exponents. 

The  signature  of  a  chiral  transformation  is  that  the 
product  of  the  incommensurability  and  the  corre¬ 
lation  length  approaches  a  constant  near  the  critical 
temperature  (Tc),  i.e.,  3-v.®  In  contrast,  according 
to  den  Nijs,  the  product  of  the  incommensurability 
and  the  correlation  length  goes  to  zero  near  Tc. 
Specifically,  he  predicts  p  =  2v.  It  is  clear  that 
careful  studies  of  experimental  realizations  of  a 
system  such  as  the  Si(113)  surface,  which  deter¬ 
mine  critical  behavior,  are  particularly  valuable. 
Below  Tc  =  959K,  Si(113)  exhibits  a  (3x1) 

commensurate  reconstruction  (p=3)  and  a  transfor¬ 
mation  into  a  disordered  IC  structure  for  temper¬ 
atures  above  Tc.®  For  Si(113),  the  reconstructed 
surface  layer  plays  the  role  of  the  overlayer,  while 
subsequent  layers  with  bulk  periodicity  constitute 
the  substrate. 

Figures  1  and  2  show  x-ray  diffraction  profiles 
obtained  at  the  National  Synchrotron  Light  Source 
near  the  surface  (5/3,  1)  position  in  reciprocal 
space  for  several  different  temperatures.  This 
experiment  employed  our  new  JSEP-funded  UHV 
surface  x-ray  scattering  apparatus.  Below  Tc  the 
scattering  profile  consists  of  a  narrow  profile  located 
at  the  (5/3,  1)-position  (o=t=0).  For  temperatures 
increasing  above  Tc,  the  peak  of  the  scattering 
shifts  away  from  a=0  (y-direction)  and  broadens  in 
both  o  and  in  t  (x-direction).  The  peak  shift  reflects 
the  formation  of  discommensurations  in  the 
erstwhile  commensurate  reconstruction,  so  that  the 
surface  layer  is  now  incommensurate.  The  peak 
broadening  indicates  that  the  translational  corre¬ 
lation  length  in  the  incommensurate  phase  is  finite, 
i.e.,  it  is  an  IC-fluid.  Figure  3  shows  results  for  the 
dependence  of  the  incommensurability  (e),  the 
inverse  correlation  lengths  (k^  and  k,),  the  square  of 
the  order  parameter  (lo),  and  the  wavevector 
dependent  susceptibility  (x)  versus  reduced  temper¬ 
ature  ((T  -  Tc)/Tc). 


Figure  1.  Scans  along  the  x-direction  through  the  (5/3, 
1)  position  in  the  reciprocal  space  of  the  reconstructed 
Si(113)  surface  versu',  temperature.  The  solid  lines  are 
least-mean-squares  fits  of  a  Lorentzian  scattering  func¬ 
tion  plus  a  smoothly-varying  background  to  the  data, 
which  determine  the  peak  shift  from  the  commensurate 
petition,  i.e.,  incommensurability  (e),  peak  width  (k.)  and 
ptik  intensity  (x). 

Each  of  these  quantities  shows  power-law  behavior 
over  approximately  two  decades  of  reduced  temper¬ 
ature  with  exponents  p  -  0.63  ±  0.05  for  the  incom¬ 
mensurability,  V,  -  0.65  ±  0.07  foi  the  inverse 
correlation  length  in  the  incommensurate  direction. 


6  F.D.M.  Haldane,  P.  Bak,  and  T.  Bohr,  Phys.  Rev.  S28:  2743  (1983). 

r  S.F.  Howes,  Phys.  Rev.  B  Z7\  1762  (1983);  M  den  Nijs,  Phase  Transitions  and  Critical  Phenomena,  eds.  C.  Domb  and  J.  Lebowitz 
(London:  Academic  Press,  1987). 

8  D  A.  Huse.  and  M.E.  Fisher,  Phys.  Rev.  Lett.  49;  793  (1982). 

9  Y.N.  Yang,  E  D.  Williams,  R.L.  Park,  N.C.  Bartelt,  and  T.L.  Einstein,  Phys.  Rev.  Lett.  64:  24' 0  (1990);  B.Z.  Olshanetsky  and  V.l, 
Mashanov,  Surf.  Sci.  Ill:  414  (1981). 
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1)  position  in  the  reciprocal  space  of  the  reconstructed 
Si(113)  surface  versus  temperature.  The  solid  lines  are 
least-mean-squares  fits  of  a  Lorentzian  scattering  func¬ 
tion  plus  a  smoothly  varying  background  to  the  data, 
which  determine  peak  width  (k,),  and  peak  intensity. 

Vy=  1.05  ±0.07  for  the  inverse  correlation  length  in 
the  transverse  direction,  2p- 0.23  ±0.03  for  the 
square  of  the  order  parameter,  and  y- 1-56  ±0.10 
for  the  susceptibility. 

These  results  are  striking  for  several  reasons. 
First,  we  believe  that  the  data  presented  in  figures 
1,  2,  and  3  represent  the  highest  quality  measure¬ 
ments  to  date  of  surface  critical  exponents.  The 
results  are  particularly  impressive  when  one  con¬ 
siders  that  the  scattering  emanates  from  relatively 
high-index  Si  surface  and  that  Si  is  not  a  strong 
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Ffgure  3.  Dependence  of  the  incommensurability  (e), 
inverse  correlation  lengths  (  k.  and  Xy),  susceptibility  (x), 
and  square  of  the  order  parameter  (lo)  versus  reduced 
temperature  ((T  -  Tc)/Tc).  Note  the  log-log  scales. 


x-ray  scatterer.  They  demonstrate  that  our  appa¬ 
ratus  teamed  with  an  intense  synchrotron  x-ray 
source  constitutes  the  world’s  premier  facility  for 
surface  x-ray  diffraction  studies.  Only  with  such 
high  quality  data  is  it  possible  to  accurately  deter¬ 
mine  critical  exponents.  Second,  it  is  clear  that  the 
exponent  for  the  correlation  length  along  the 
x-direction  is  different  from  that  for  the  correlation 
length  along  the  y-direction,  i.e.,  there  is  anisotropic 
scaling.  Such  behavior  is  unusual,  although  one 
instance  in  which  it  occurs  is  the  uniaxial 
C-solid-to-IC-solid_  transformation,  where  exact 
results  give  v, -p-1/2  and  Vy-1.'°  Furthermore, 
Howes”  has  suggested  that  Vy-1  and  v, -2/3  at 
the  special  point  in  the  phase  diagram  (Lifshitz 
point)  where  the  line  of  direct  C-solid-to-IC-fluid 
transformations  ends  and  is  replaced  by  successive 
C-solid-to-IC-solid  and  IC-solid-to-IC-fluid  transfor¬ 
mations.  Third,  the  measured  exponent  values  are 


'0  V.L.  Pokrovski,  and  A.L.  Talapov,  Phys.  Rev.  Lett.  42:  65  (1979). 
1'  S.F.  Howes.  Phys.  Rev.  8  27:  1762  (1983). 
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consistent  with  the  hyperscaling  relation; 
V,  +  Vy  =  7  +  2p.  A  final  important  observation  is  that 
the  product  of  the  incommensurability  and  the  cor¬ 
relation  lengjh  along  the  incommensurate  direction 
is  constant  (p  -  v,)  as  expected  for  chiral  melting. 

A  direct  C-solid-to-IC-fluid  transformation  indicates 
that  the  phase  diagram  proposed  by  Haldane  et  al. 
is  incorrect.  Furthermore,  the  observation  of  aniso¬ 
tropic  scaling  informs  us  that  the  disordering  trans¬ 
formation  of  Si(113)  is  not  in  the  universality  class 
of  the  3-state  Potts  model.  It  follows  that  these 
data  imply  that  the  C  solid-to-IC-fluid  transformation 
of  Si(113)  lies  in  a  chiral  melting  universality  class 
as  proposed  by  Huse  and  Fisher.  Moreover,  the 
previously  unknown  exponents  which  characterize 
the  chiral  melting  universality  class  have  been 
determined. 

We  are  confident  that  these  results  will  stimulate 
and  guide  further  theoretical  work  aimed  at  finally 


understanding  the  critical  behavior  at  the 
C-solid-to-IC-fluid  transformation  with  p=3.  This 
would  complete  our  understanding  of  phase  trans¬ 
formations  in  this  type  of  model  two-dimensional 
system. 


5.2  Publications 
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Abernathy,  D.L.,  S.G.J.  Mochrie,  D.M.  Zehner,  G. 
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1.1  Optical  Data  Storage  with  Raman 
Excited  Spin  Echoes 
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There  has  been  much  recent  interest  in  the  use  of 
optical  photon  echoes  for  information  storage  and 
processing.'*  So  far,  however,  the  most  successful 
demonstrations  have  been  restricted  to  doped  crys¬ 
tals  operating  at  liquid  helium  temperatures,  and 
homogeneous  decay  of  the  excited  state  limits  per¬ 
formance.  Here  we  demonstrate,  theoretically  as 
well  as  experimentally,  that  optical  data  can  be 
stored  and  retrieved  using  spin  echoes  which  are 
excited  and  detected  by  the  optical  resonance 
Raman  interaction.  The  advantage  is  that  spin 
echoes  contain  no  contribution  from  the  optical 
excited  state  and  therefore  are  not  limited  by 
excited  state  decay.  Moreover,  since  Raman  spin 
echoes  are  optically  excited  and  detected,  full 
optical  holographic  spatial  resolution  is  available. 
Finally,  it  might  be  possible  to  develop  high  temper¬ 


ature  optical  memories  and  processors,  since  spin 
echo  data  storage  has  been  demonstrated  at  room 
temperature.® 

The  ability  of  Raman  excited  spin  coherence  to 
store  optical  temporal  and  phase  information  is 
based  on  the  fact  that  this  coherence  is  sensitive  to 
the  relative  phases  of  the  optical  fields.®  When  inho¬ 
mogeneous  broadening  is  present,  the  differential 
optical  phase  sensitivity  permits  the  storage  of 
optical  temporal  variations,  just  as  in  conventional 
optical  (or  microwave)  echo  storage  schemes. 

Experimental  realization  of  the  three-part  storage 
and  recall  scheme  is  accomplished  with  a  sodium 
atomic  beam  that  has  three  spatially  separated 
interaction  zones  and  uses  an  off-resonant  optical- 
Raman  beam  as  the  rephasing  field.  The  exper¬ 
imental  setup  is  illustrated  in  figure  1.  Here,  the 
Raman  transition  is  the 

3^Si/2(F=l,m=1)«' 


3^Pi/2{F=2,m=2)-M.32Si/2{F=2,m=1) 


'  Rome  Laboratory,  Hanscom,  Massachusetts. 

2  Harvard  University,  Cambridge,  Massachusetts. 

3  Tufts  University,  Medford,  Massachusetts. 

*  M.K.  Kim  and  R.  Kachru,  "Storage  and  Phase  Conjugation  of  Multiple  Images  Using  Backward-stimulated  Echoes  in  Pr’-r^  :LaF„’  Opt. 
Lett.  12:  593  (1987). 

5  S.  Fernbach  and  W.G.  Proctor,  "Spin-echo  Memory  Device,'  J.  Appt.  Phys.  26: 170  (1955). 

6  M.S.  Shahriar  and  P.R.  Hemmer,  "Direct  Excitation  of  Microwave-spin  Dressed  States  Using  a  Laser-excited  Resonance  Raman 
Interaction,"  Phys.  Rev.  Lett.  65:  1865  (1990). 
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Figure  1.  Experimental  setup  for  optical  data  storage 
and  recall  using  optical-Raman  excited  spin  echoes,  in  a 
sodium  atomic  beam. 


transition  at  589.7  nm  optical  frequency,  where  the 
ground  state  splitting  is  1.772  GHz  and  the  excited 
state  decay  time  is  16  nsec.  Ground  state  inhomo¬ 
geneous  broadening  is  produced  using  the  linear 
Zeeman  shift  and  an  applied  magnetic  field  gradient 
of  1  Gauss/cm,  to  give  a  total  ground  state  inhomo¬ 
geneous  broadening  of  450  kHz. 

The  optical  pulse  sequence  is  shown  in  figure  2a. 
The  first  two  Raman  pulses  interfere  to  store  the 
optical  temporal  information  via  spectral  hole¬ 
burning.  The  rephasing  is  accomplished  with  an 
off-resonance  Raman  n-pulse.  The  Raman  probe 
beam  senses  the  echo  via  changes  in  absorption. 
Figure  2b  shows  the  experimentally  observed  echo 
signals.  The  echo  amplitude  is  about  50%  of  the 
maximum  demodulated  signal  obtained  when  a 
single,  saturating  Raman  data  pulse  is  applied  in 
the  absence  of  a  magnetic  field  gradient  (no 
dephasing).  Figure  2c  shows  the  theoretical  echo 
signal  calculated  using  a  numerical  solution  of  the 
optical  Bloch  equations  for  input  parameters  that 
approximately  correspond  to  experimental  condi¬ 
tions  of  figure  2c.  Good  qualitative  agreement 
between  theory  and  experiment  is  achieved,  both  in 
the  width  and  peak  amplitude  of  the  echo  signals. 

Although  the  present  optical  data  storage  exper¬ 
iment  employs  optical  data  pulses  that  are  longer 
than  the  excited  state  homogeneous  decay  time, 
this  is  not  a  fundamental  requirement.  In  fact, 
Raman  resonant  optical  pulses  shorter  than  the 
optical  homogeneous  decay  time  can  also  be 
stored.  The  new  fundamental  limit  on  optical 
storage  capacity,  using  Raman-optical  echoes,  is 
given  by  the  ratio  of  the  spin  homogeneous  lifetime 
to  the  optical  inhomogeneous  lifetime.  This  is  typi- 


Figure  2.  (a)  Pulse  sequence  for  data  storing  and 

retrieval,  (b)  Experimentally  observed  echo  signals,  (c) 
Corresponding  theoretical  echoes. 

cally  several  orders  of  magnitude  larger  than  for 
optical  two-level  echoes.  Furthermore,  if  the 
rephasing  microwave  ji-pulse  is  split  into  two 
temporally  separated  ji/2-pulses,  then  archival 
storage  is  possible. 

In  summary,  we  have  experimentally  demonstrated 
that  the  Raman  transparent  states  can  store  and 
retrieve  optical  temporal  and  phase  information  via 
a  ground  state  spin  echo  process.  This  opens  up 
the  possibility  of  enhancing  the  performance  of 
optical  photon  echo  memories.  Future  work 
includes  demonstration  of  this  type  of  data  storage 
in  cryogenic  doped  crystals  as  well  as  a  search  for 
higher  temperature  materials. 


1.2  Phase-dependent  Velocity  Selective 
Coherent  Population  Trapping  in  a 
Folded  Three-level  A  System  under 
Standing  Wave  Excitation 

Sponsors 
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U.S.  Navy  -  Office  of  Naval  Research 
Grant  NO0 1 4-9 1-J- 1808 

Recently,  there  has  been  a  great  deal  of  interest  in 
the  forces  experienced  by  a  folded  three-level  A 
atom.  Aspect  et  al.^  first  demonstrated  that  such  an 
atom  can  be  cooled  below  the  recoil  limit  via 
velocity  selective  coherent  population  trapping 
(VSCPT)  when  excited  by  a  pair  of  counter- 
propagating  traveling  waves.  Here,  we  show  the¬ 
oretically  that  VSCPT  occurs  in  a  A  atom  excited  by 
a  pair  of  Raman  resonant  standing  waves,  and  its 
efficiency  depends  on  the  relative  phase, 
between  the  standing  waves.  Finally,  we  describe 
briefly  the  generalization  of  this  process  to  three 
dimensions. 

The  importance  of  VSCPT  in  Raman  resonant 
standing  waves  stems  from  our  earlier  experimental 
observation  of  deflection  and  cooling  of  A  sodium 
atoms  in  an  atomic  beam.®  This  experiment  sug¬ 
gests  that  these  standing  wave  forces  could  be 
used  to  design  a  stimulated  force  trap.  In  addition, 
our  theory  predicts  that  the  cooling  can  be  made  to 
have  characteristics  very  similar  to  those  of  conven¬ 
tional  polarization  gradient  cooling  (see  next 
section),  so  that  such  a  trap  should  have  a  sub- 
Doppler  temperature.  The  theory  presented  here 
suggests  that  it  may  be  possible  to  reach  a  sub¬ 
recoil  temperature  in  a  Raman  force  trap. 

In  analogy  to  the  traveling  wave  VSCPT,  we  are 
interested  in  finding  a  state  (the  dark  state)  which 
satisfies  the  following  conditions:  (1)  it  does  not 
contain  any  excited  states  so  that  it  is  completely 
decoupled  from  the  vacuum  fields,  and  (2)  the  net 
amplitude  for  coupling  this  state  to  any  of  the 
excited  states  must  vanish.  Let  us  denote  the 
linear  momentum  of  the  atom  by  p  and  the 
wavenumber  of  the  optical  field  by  k.  It  can  be 
shown  that  if  p  =  0  and/or  cj)  -  0  then  the  state: 

lNC(p)>=^ 

[  I  a  ,  p  -  Ak  >  exp  ( -  i(t»)  + 1  a  ,  p  +hk> 

exp  (i(j))  -  1  b  ,  p  -  fik>  -  I  b,  p+hk>] 

does  not  couple  to  any  excited  state,  as  illustrated 
in  figure  3.  Here,  the  square  boxes  represent  the 


boxes  represent  the  relative  weights  of  the  matrix 
element.  Here,  we  have  chosen  5  -  -  Q,. 

relative  transition  matrix  elements.  Summing  over 
the  allowed  transitions  we  find,  for  an  atom  starting 
in  I NC  >  at  t  =  0,  ‘hat  the  amplitude  of  being  in  the 
excited  state  after  a  time  dt  is  given  by 
dAe  ■  -  2i(dt)®gk(p/m)  sin  (j>,  where  g  is  the  Rabi  fre¬ 
quency,  and  m  is  the  mass  of  the  atom.  Thus,  for 
p  =  0  and/or  <j)=  0,  |  NC  >  is  a  dark  state.  For 

4» 0,  we  therefore  have  a  single,  zero  velocity  dark 
state,  and  VSCPT  occurs.  The  efficiency  of  VSCPT 
would  vary  as  sin^cj),  being  maximum  at 
<t)=  (2n+1)n/2  and  vanishing  at  <|)  =  nn.  We  also  find 
that  for  (|)  -  (2n+1)  n/2,  the  standing  wave  VSCPT  is 
nearly  as  efficient  as  the  traveling  wave. 

The  existence  of  VSCPT  corresponds  to  the 
absence  of  diffusion  for  zero  velocity  atoms  in  the 
trapped  state.  However,  Chang  et  al.®  have  previ¬ 
ously  computed  a  non-zero  diffusion  coefficient  in 
Raman  resonant  standing  waves  by  a  perturbative 
numerical  solution  of  the  Wigner  density  matrix 
equation  of  motion.  This  disagreement  with  Chang 
et  al.  seems  to  stem  from  the  fact  that  they 
assumed,  a  priori,  that  the  distribution  of  atoms  in 
momentum  space  is  smooth,  which  is  in  sharp  con¬ 
trast  with  the  result  of  VSCPT.  We  should  also 
point  out  that  VSCPT  observed  by  Aspect  et  al.  for 
a  variable  angle  between  the  planes  of  polarization 
of  two  counterpropagating  traveling  waves  can  be 
interpreted  in  terms  of  the  theory  of  standing  wave 
VSCPT  as  developed  here,  thus  validating  our 


^  A.  Aspect,  E.  Arimondo,  R.  Kaiser,  N.  Vansteenkisto,  and  C.  Cohen-Tannoudji,  Phys.  Rev.  Lett.  61:  826  (1988). 

8  P.  Hommer,  M,  Shahriar,  M.  Prentiss,  D.  Katz,  K.  Berggren,  J.  Mervis  and  N.  Bigelow,  Phys.  Rev.  Lett.  68:  3148  (1992). 

9  S.  Chang,  B.  Garraway,  and  V.  Minogin,  Opt.  Comm.  77: 19  (1990). 
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result.  Finally,  we  have  found  that  this  process  can 
be  generalized  to  three  dimensions  employing  a 
j  =  1  « j'  =  1  transition,  with  three  mutually 
orthogonal  pairs  of  Raman  resonant  standing 
waves. 

In  summary,  we  have  shown  theoretically  that 
phase  sensitive  VSCPT  takes  place  when  a  A 
system  is  excited  by  a  pair  of  Raman  resonant 
standing  waves.  This  corresponds  to  absence  of 
diffusion  for  zero  velocity  atoms  in  the  trapped 
state,  in  disagreement  with  predictions  made  by 
Chang  et  al.  We  also  find  that  previous  exper¬ 
imental  observations  reported  by  Aspect  et  al. 
agree  with  our  predictions.  Finally,  we  outline  the 
generalization  to  three  dimensions.  Given  our  prior 
observation  of  stimulated  trapping  and  cooling 
forces  and  our  theoretical  prediction  of  strong  pol- 
grad  cooling  in  Raman  resonant  standing  waves, 
the  simultaneous  existence  of  VSCPT  may  make  it 
possible  to  design  a  subrecoil  temperature  trap  for 
A  atoms. 


1 .3  Continuous  Pol-Grad  Pre-Cooling 
for  Loading  a  Subrecoil  Temperature 
Trap 

Sponsors 

U.S.  Air  Force  -  Electronic  Systems  Division 
Contract  F19628-92-K-0013 
U.S.  Navy  -  Office  of  Naval  Research 
Grant  N001 4-91 -J-1 808 

As  discussed  in  the  preceding  section,  there  has 
been  increasing  interest  in  multilevel  atoms  that 
undergo  velocity  selective  coherent  population  trap¬ 
ping  (VSCPT)  into  a  zero  velocity  dark  state. 
However,  since  VSCPT  takes  place  via  a  random 
walk  in  momentum  space,  in  three  dimensions  the 
efficiency  would  fall  off  rapidly  as  a  function  of 
velocity,  with  a  capture  range  of  the  order  of  the 
recoil  velocity.  Therefore,  in  order  to  significantly 
populate  the  3D  dark  state,  it  is  necessary  to  pre¬ 
cool  atoms  to  the  recoil  limit.  It  is  advantageous  for 
this  pre-cooling  to  occur  simultaneously  with 
VSCPT:  othenwise  the  random  walk  causes  most  of 
the  atoms  to  heat  up  to  velocities  beyond  the 
capture  range.  This  type  of  pre-cooling  does  not 
exist  in  the  various  VSCPT  schemes  considered  so 
far. 

Here,  we  present  a  new  mechanism  under  which 
polarization-gradient  cooling  efficiently  slows  atoms 
from  a  sample  at  the  Doppler  limit  to  the  recoil  limit. 
Unlike  other  methods  of  cooling  to  the  recoil  limit, 
this  cooling  occurs  under  the  same  conditions  as 
required  for  VSCPT.  We  illustrate  the  basic  mech¬ 


anism  in  one  dimension  using  a  Sisyphus  model. 
The  predictions  of  this  model  are  consistent  with 
numerical  results  obtained  from  continued  fractions 
as  well  as  from  numerical  integrations  of  the  optical 
Bloch  equations.  We  also  discuss  the  generaliza¬ 
tion  of  this  scheme  to  three  dimensions.  This  new 
mechanism  opens  the  possibility  of  continuously 
cooling  a  large  number  of  atoms  to  a  subrecoil  tem¬ 
perature. 

The  basic  features  of  this  cooling  technique  are  well 
demonstrated  in  one  dimension  by  a  A  system 
excited  by  a  pair  of  Raman  resonant  standing  wave 
fields.  Efficient  cooling  is  obtained  in  the  case 
where  the  common  mode  detuning  b  is  positive  and 
the  phase  difference  (j)  between  the  standing  waves 
is  ji/4.  We  have  developed  a  Sisyphus  type  model 
to  interpret  this  process  physically.  This  model  is 
illustrated  in  figure  4,  using  the  non-absorbing 
( I  -  > )  and  absorbing  ( I  +  >  )  superposition  states. 
Figure  4a  shows  the  Rabi  frequencies,  figure  4b 
shows  the  population  of  I  +  >  ,  and  figure  4c  shows 
the  Stark  shifts  of  I  +  >  and  I  -  >  .  As  can  be  seen, 
the  atoms  on  average  climb  energy  hills,  thus  expe¬ 
riencing  cooling. 

Figure  5a  (thick  line)  shows  a  plot  of  the  averaged 
force  in  natural  units  as  a  function  of  velocity 
obtained  from  a  continued  fraction  solution.  The 
parameters  used  here  are  <^  =  7i/4,  go  = 
0.3  and  6  =1.0.  We  fit  this  plot  to  a  function  of  the 
form  f(v)  =  -^*v/(1  +  v^/v§),  which  is  maximum  at 
v-Vc.  We  find  Vc*  5.5  X  10-3  and  t;o  »  0.81. 
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The  Sisyphus  type  model  predicts  Vc=  3.4x10-3 
and  to  «  0.97,  with  reasonable  agreement.  For 
equivalent  values  of  parameters,  these  numbers  are 
comparable  (^=3.0,  Vc=  4.0x10-3)  to  the  ones 
estimated  by  Dalibard  et  al.'°  for  a  J  =  1/2  **J'  = 
3/2  transition.  The  dashed  line  superimposed  on 
figure  5a  shows  the  corresponding  values  of  the 
Doppler  cooling  force,  calculated  for  the  I  a  >  *♦ 
|e>  transition,  in  the  absence  of  I  b  >,  for  the 
same  parameters.  As  can  be  seen,  the  pol-grad 
cooling  coefficient  (slope)  is  about  a  factor  of  30 
larger  than  the  Doppler  cooling  coefficient.  Figure 
5b  shows  how  the  force  varies  as  a  function  of  (>, 
for  V  =  5.5  X  10-3. 


Figure  5.  (a)  The  thick  line  shows  the  averaged  pol-grad 
cooling  force  for  sodium  as  a  function  of  velocity,  in  units 
of  Fs=  10-3x  fikr/2.  The  dashed  line  shows  the  corre¬ 
sponding  Doppler  cooling  force,  and  the  dotted  area  rep¬ 
resents  the  capture  range  for  VSCPT.  (b)  The  solid  line 
shows  the  pol-grad  cooling  force  as  a  function  of  <(i.  The 
dotted  line  shows  the  efficiency  of  VSCPT. 


As  discussed  above,  this  system  undergoes  VSCPT 
into  a  zero  velocity  dark  state,  so  that  the  equilib¬ 
rium  temperature  is  limited  only  by  the  interaction 
time  and  can  be  substantially  below  the  recoil  limit. 
The  efficiency  of  VSCPT  into  this  state  varies  as 
sin^*)),  as  illustrated  by  the  dotted  line  in  figure  5b. 
We  have  also  found  how  to  generalize  this  scheme 
to  three  dimensions,  following  the  approach  of 
Ol’shanii  et  al." 

We  have  estimated  the  enhancement  of  the  effi¬ 
ciency  of  VSCPT  in  three  dimensions  due  to  the 
pol-grad  pre-cooling.  Consider,  for  example,  a 
sample  of  sodium  atoms  cooled  to  the  Doppler  limit. 
In  the  presence  of  the  pol-grad  cooling  mechanism, 
the  atoms  would  be  pre-cooled  to  a  velocity  spread 
of  the  order  of  v„  so  that  the  time  needed  for  all  the 
atoms  to  end  up  in  the  dark  state  would  be  much 
less  than  that  needed  in  the  absence  of  such  pre¬ 
cooling.  Preliminary  calculations  suggest  nearly 
three  orders  of  magnitude  enhancement  in  effi¬ 
ciency  from  pol-grad  pre-cooling. 

In  summary,  we  show  that  pol-grad  cooling  occurs 
in  the  A  system  simultaneously  with  VSCPT.  This 
process  may  continuously  and  efficiently  transfer 
atoms  from  a  Doppler  temperature  sample  to  near¬ 
recoil  velocities  that  are  within  the  capture  range  of 
VSCPT,  which  would  then  continuously  cool  the 
atoms  to  below  the  recoil  limit.  This  pol-grad  pre¬ 
cooling  is  estimated  to  enhance  the  VSCPT 
pumping  rate  by  nearly  three  orders  of  magnitude 
compared  to  the  rate  achievable  from  Doppler  pre¬ 
cooling  alone.  We  present  results  from  a  Sisyphus 
model,  along  with  numerical  results  obtained  from 
continued  fractions.  Experimental  efforts  are  in  pro¬ 
gress  for  realizing  this  scheme  in  three  dimensions. 


1.4  Raman  Gain  in  a  A  Three-level 
System  with  Closely  Spaced  Ground 
States 

Sponsor 

U.S.  Air  Force  -  Electronic  Systems  Division 
Contract  F19628-92-K-0013 

Recently,  there  has  been  much  interest  in  the 
Raman  gain  observed  in  a  A  system  with  closely 
spaced  ground  states  such  that  both  ground  states 
are  initially  equally  populated  and  are  both  coupled 


’0  J.  Dalibard  and  C.  Cohen-Tannoudji,  J.  Opt.  Soc.  Am.  B  6;  2023  (1989). 

1'  M.  Ol'shanii  and  V.  Minogin,  In  Proceedings  of  the  International  Workshop  on  Light  Induced  Kinetic  Effects  on  Atoms,  Ions,  and 
Molecules,  ed.  L.  Moi  et  al,  (Pisa,  Italy:  ETS  Editrice,  1991),  p.  99. 
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to  the  excited  state  by  a  single  frequency  pump.’* 
This  system  is  of  interest  for  appliCc:*ions  such  as 
frequency  shifting, self  phase  conjugation, and 
high  precision  laser  magnetometry. 

We  have  studied  the  mechanism  of  this  gain  and 
found  qualitative  agreement  with  experimental 
results.’®  Briefly,  a  single  frequency  pump  interacts 
with  both  legs  of  the  system,  as  shown  in  figure  6a. 
Figure  6b  shows  a  typical  gain  lineshape.  The  ver¬ 
tical  rectangle  superimposed  on  this  plot  indicates  a 
resonance  frequency  of  the  dressed  states  of  the 


atom  and  the  pump.  As  can  be  seen,  gain  is 
peaked  at  this  resonance  frequency.  Thus,  we  are 
able  to  interpret  the  gain  spectrum  in  terms  of  the 
energy  intervals  of  the  dressed  states  of  the 
pumped  system. 

The  gain  mechanism  is  analogous  to  the  process  of 
sideband  amplification  in  a  strongly  driven  two-level 
system.  However,  the  width  of  gain  is  not  limited 
by  the  linewidth  of  the  excited  state.  This  is  illus¬ 
trated  in  figure  6c,  where  the  linewidth  is  plotted  as 
a  function  of  pump  Rabi  frequency.  As  can  be 


♦ 


(0  (d) 


Figure  6.  (a)  Doubly  pumped  Raman  gain  scheme,  (b)  A  typical  gain  lineshape,  centered  at  a  resonance  frequency  of 
the  dressed  states,  (c)  Linewidth  of  gain  as  a  function  of  pump  Rabi  frequency,  (d)  Gain  as  a  function  of  pump  detuning. 
The  gain  is  expressed  in  arbitrary  units. 


'2  J.  Donoghue,  M.  Cronin-Golomb,  J.S.  Kane,  and  P.R.  Hemmer,  Opt.  Lett.  16:  1313  (1991);  M.  Poelker  and  P.  Kumar,  Opt.  Lett. 
17(6):  399  (1992);  M.S.  Shahriar,  P.R.  Hemmer,  J.  Donoghue,  M.  Cronin-Golomb,  and  P.  Kumar,  OSA  Annual  Meeting  Tech.  Dig. 
23:  127  (1992). 

'3  M.  Poelker  and  P,  Kumar,  Opt.  Lett.  17(6):  399  (1992). 

J,  Donoghue,  M.  Cronin-Golomb,  J.S.  Kane,  and  P.R.  Hemmer.  Opt.  Lett.  16: 1313  (1991). 
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seen,  linewidth  falls  off  linearly  with  the  Rabi  fre¬ 
quency.  In  practice,  linewidth  is  limited  only  by 
additional  factors  such  as  transit  time.  This  is  con¬ 
sistent  with  extremely  narrow  gains  (less  than 
1  kHz)  observed  in  sodium  vapor.  In  addition,  we 
have  found  that  gain  amplitude  becomes  a  constant 
below  a  certain  intensity.  Thus,  this  gain  mech¬ 
anism  is  in  principle  thresholdless  (if  not  limited  by 
transit  time).  This  is  because  the  dressed  states 
population  inversion  needed  for  this  gain  is  created 
via  optical  pumping.  Thus,  for  a  long  enough 
transit  time,  optical  pumping  saturates  even  for  a 
very  low  pump  intensity. 

In  summary,  we  have  studied  the  Raman  gain 
mechanism  for  a  doubly  pumped  A  system.  For  a 
low  intensity,  the  gain  amplitude  and  width  are 
limited  by  transit  time  only.  These  observations  are 
in  qualitative  agreement  with  previous  experimental 
observations.  One  of  the  various  applications  of 
this  process  is  high  resolution  laser  magnetometry. 
We  have  performed  preliminary  experiments  for  this 
application,  with  encouraging  results. 

1.5  Brillouin  Laser  Fiberoptic 
Gyroscope 

Sponsor 

Charles  S.  Draper  Laboratory 
Contract  DL-H-418522 

Research  is  in  progress  on  a  new  fiberoptic  ring 
laser  gyroscope  based  on  two  counter-propagating 
stimulated  Brillouin  scattering  (SBS)  lasers  which 
are  generated  in  the  same  fiberoptic  ring 
resonator.'®  The  use  of  SBS  is  crucial  to  the  opera¬ 
tion  of  this  gyro  because  a  conventional  solid-state 
gain  medium  cannot  support  simultaneous 
bidirectional  lasing  due  to  gain  competition.  It  is  the 
directional  property  of  the  SBS  gain  medium  that 
prevents  gain  competition  and  thus  allows  stable, 
simultaneous,  bidirectional  lasing. 

In  the  presence  of  an  inertial  rotation  normal  to  the 
plane  of  the  resonator,  a  difference  frequency  is 
automatically  generated  between  the  counterpropa- 
gating  SBS  lasers  which  is  directly  proportional  to 
the  rotation  rate,'^  as  predicted  by  the  Sagnac 
effect.  The  operation  of  this  gyro  is  very  similar  to 
that  of  the  bulkoptic  ring  laser  gyroscope  (RLG) 
based  on  the  He-Ne  gain  medium.  It  should  be 


noted  that  gain  competition  in  a  gaseous  medium  is 
avoided  by  using  two  partially  overlapping,  Doppler 
broadened  gain  media. 

In  contrast  with  the  passive  interferometer  or  reson¬ 
ator  gyroscope,  the  SBS  fiber  RLG,  as  in  the 
bulkoptic  RLG,  does  not  require  external  means  to 
measure  the  nonreciprocal  phase  shift  that  is 
induced  by  rotation. 

Figure  7  shows  a  simplified  schematic  of  a  SBS 
ring  laser  gyroscope.  Light  from  a  1  mW  He-Ne 
laser  at  1.15  |xm  is  split  into  two  pump  beams,  PI 
and  P2,  shifted  by  acousto-optic  modulators,  and 
coupled  into  counterpropagating  directions  of  the 
same  ring  resonator.  For  maximum  effective  pump 
power  inside  the  resonator,  the  pump  lasers  are 
held  at  the  center  of  a  cavity  resonance,  using  a 
servo  not  shown  in  the  figure,  and  are  matched  to 
the  polarization  of  the  cavity  resonance. 

With  the  pump  lasers  PI  and  P2  above  the  60  pW 
Brillouin  threshold,  two  SBS  lasers,  B1  and  B2,  are 
generated  simultaneously  in  directions  opposite  to 
those  of  their  respective  pumps.  The  two  SBS 
lasers  are  then  combined  via  a  coupler  and  fall  onto 
detector  D. 

Figure  8  shows  the  difference  frequency  between 
B1  and  B2  when  a  sinusoidal  rotation  is  applied  to 
the  gyroscope.  As  predicted  by  the  Sagnac  rffect, 
the  frequency  difference  varies  h,. early  with  the 
applied  rotation  rate,  which  is  90  degrees  out  of 
phase  with  the  rotation  angle  in  figure  8b. 

However,  it  is  important  to  note  that,  in  figure  8,  for 
a  range  of  low  rotation  rates  the  frequency  differ- 


Figure  7.  Simplified  schematic  diagram  for  a  SBS  ring 
laser  gyroscope. 


'6  S.P.  Smith,  F.  Zarinetchi,  and  S  Ezekiel,  "Fiberoptic  Ring  Laser  Gyroscope,"  Proceedings  of  OPS  ‘89,  Paris,  France,  1989,  post 
deadline  paper. 

’7  F.  Zarinetchi,  S.P.  Smith,  and  S,  Ezekiel,  "Stimulated  Brillouin  Fiber  Optic  Laser  Gyroscope,"  Opt.  Lett.  16:  229  (1991). 
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Figure  8.  Simultaneous  recording  of  (a)  difference  fre¬ 
quency  between  SBS  lasers  as  a  function  of  rotation  and 
(b)  angle  of  the  applied  rotation. 


ence  between  the  SBS  lasers  is  zero.  This 
"lock-in"’  zone  is  similar  to  that  observed  in  the 
bulkoptic  RLG  and  is  caused  by  the  coupling  of  the 
counterpropagating  lasers  through  backscattering 
within  the  cavity.  In  the  bulkoptic  RLG,  the  lock-in 
effect  has  been  thoroughly  studied  and  is  removed 
by  mechanical  dither.'® 

Mechanical  rotation  dither  minimizes  the  amount  of 
time  th^t  the  gyro  spends  in  the  lock-in  zone,  thus 
reducing  the  errors  due  to  lock-in.  Mechanically 
dithering  the  fiber  RLG  would  be  much  simpler  tnan 
dithering  the  bulkoptic  RLG  because  of  its  very 
small  mass. 

Another  promising  method  for  the  removal  of 
lock-in,  which  is  being  investigated  here  and  else¬ 
where,'®  is  optical  dither.  In  this  case,  instead  of 
mechanically  rotating  the  cavity,  the  optical  path 
length  of  the  cavity  is  modulated  to  generate  a  non¬ 
reciprocal  phase  shift  between  the  SBS  lasers.  The 
optical  path  length  of  the  cavity  can  be  modulated 
using  two  phase  shifters,  PS1  and  PS2,  symmet¬ 
rically  located  on  the  cavity,  as  shown  in  figure  9. 
With  the  phase  shifters  driven  180  degrees  out  of 
phase,  there  is  no  net  change  in  the  optical  length 
of  the  cavity.  However,  due  to  the  finite  time  it  takes 
the  light  to  go  from  one  modulator  to  the  other,  a 
small  nonreciprocal  phase  shift  is  generated.  By 
appropriately  adjusting  the  phase  modulations,  the 


Figure  9.  Simplified  schematic  diagram  of  a  fiber  reson¬ 
ator  with  optical  dither. 


coupling  between  the  lasers  can  be  greatly  sup- 
:.'essed  or  completely  eliminated. 

Figures  10  and  11  demonstrate  the  effect  of 
adjusting  optical  dither  in  a  fiber  cavity  with  high 
backscattering.  Figure  10a  sliows  the  difference  in 
frequency  between  the  SBS  lasers  for  a  peak  rota¬ 
tion  equivalent  to  a  Sagnac  frequency  of  approxi¬ 
mately  9  kHz  and  a  lock-in  zone  of  about  1  kHz, 
with  Figure  10b  indicating  the  corresponding  rota¬ 
tion  angle.  Figure  11  shows  the  response  to  the 
same  rotation  with  reduced  optical  dither,  thus 
increasing  the  size  of  the  lock-in  zone  to  approxi¬ 
mately  2  kHz.  In  practice,  the  maximum  achievable 
lock-in  suppression  is  limited  by  several  factors 
including  errors  in  modulator  location,  misadjust- 
ment  of  modulation  parameters,  and  modulation 
distortion. 

Another  method  for  removing  the  lock-in  is  by  gen¬ 
erating  the  SBS  lasers  with  a  large  frequency  differ¬ 
ence  between  them  to  prevent  the  lasers  from 
locking.  It  is  possible  to  generate  a  separation  in 
frequency  between  the  SBS  lasers  by  either  fre¬ 
quency  shifting  one  of  the  pump  lasers  or  by  using 
separate  pumps.  In  this  way,  the  counter- 
propagating  SBS  lasers  can  be  generated  in  dif¬ 
ferent  longitudinal  modes  of  the  cavity.  Figure  12 


’8  F.  Aronowitz,  "The  Laser  Gyro,"  in  Laser  Applications,  ed.  M.  Ross  (New  York:  Academic  Press.  1971),  vol.  1,  pp,  133-200. 

'9  S.  Huang,  K.  Toyama,  P.A.  Nicati,  L.  Thevenaz,  B.Y.  Kim.  and  H.J.  Shaw,  "Brillouin  Fiber  Optic  Gyro  with  Push-pull  Phase  Modulator 
and  Synthetic  Heterodyne  Detection,"  Proc.  SPIE  1795  (1992),  Fiber  Optic  and  Laser  Sensors  X,  Boston,  Massachusetts,  September 
8-11,  1992. 
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Figure  10.  (a)  Recording  of  SBS  laser  difference  fre¬ 
quency  and  (b)  the  corresponding  rotation  angle  with 
applied  optical  dither. 

shows  the  lock-in  free  output  beat  frequency  with 
rotation  for  such  a  configuration. 

Aside  from  the  temperature  sensitivity  of  this  mode 
of  operation,  there  are  a  number  of  additional  error 
sources  that  are  unique  to  such  an  SBS  gyro.  One 
such  error  is  due  to  bias  variations  caused  by  SBS 
dispersion  pulls.  In  this  case,  the  backscattering  of 
pump  laser  P1,  for  example,  generates  a  very  small 
SBS  gain  in  the  same  direction  as  P1.  This  new 
SBS  gain,  however,  is  also  in  the  same  direction  as 
SBS  laser  B2  and  will  pull  the  frequency  of  B2, 
causing  a  bias  variation.  Further,  since  the  back- 
scattering  in  the  cavity  is  highly  variable,  the  size  of 
the  new  SBS  gain  and  hence  the  size  of  this  fre¬ 
quency  pull  will  also  be  highly  variable. 

In  addition  to  lock-in,  there  are  other  error  sources 
that  are  unique  to  any  type  of  fiber  resonator  gyro. 
These  include  fiber  birefringence  and  the  nonlinear 
optical  Kerr  effect,^®  both  of  which  have  been 
observed  and  must  be  appropriately  controlled. 


—^2.75  sec  l*“ 


Figure  11.  (a)  SBS  laser  difference  frequency  and  (b) 
the  corresponding  rotation  angle  with  reduced  optical 
dither. 


Figure  12.  (a)  SBS  laser  difference  frequency  with 

applied  rotation  and  (b)  the  corresponding  rotation  angle 
for  SBS  gyro  using  SBS  lasers  with  widely  separated  fre¬ 
quencies. 


20  S.  Ezekiel.  J.L.  Davis,  and  R.W.  Hellwarth,  "Observation  of  Intensity-Induced  Nonreciprocity  in  a  Fiber  Optic  Gyroscope,"  Opt.  Lett. 
7:  457  (1982) 
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2.1  Millimeter-Wave  Frequency 
Measurement  of  the  Rydberg  Constant 

Sponsors 

Joint  Services  Electronics  Program 
Contract  DAAL03-92-C-0001 
National  Science  Foundation 
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Pin  P.  Chang,  Dr.  Theodore  W.  Ducas,  Jeffrey  R. 
Holley,  Robert  P.  Lutwak,  Scott  N.  Paine,  Professor 
Daniel  Kleppner 

The  Rydberg  constant,  R„  relates  the  wavelengths 
of  the  spectrum  of  atomic  hydrogen  to  practical  lab¬ 
oratory  units.  As  such,  R.  is  the  natural  unit  for 
measurements  of  many  atomic  properties.  Recent 
advances  in  optical  wavelength  metrology  have 
enabled  experiments  using  laser  spectroscopy  to 
measure  R,  with  accuracy  approaching  2  parts  in 
10".“ 


While  R.  remains  the  most  accurately  measured 
fundamental  constant,  other  high-precision  exper¬ 
iments,  which  depend  on  R.  as  an  auxiliary  con¬ 
stant,  demand  even  more  accurate  measurement. 
The  accuracy  of  optical  wavelength  measurements 
is  approaching  the  practical  limitations  of  wave¬ 
length  metrology.  Further  progress  in  the  measure¬ 
ment  of  R.  will  require  frequency  measurements 
making  use  of  the  modern  definition  of  length  in 
terms  of  time  intervals  and  the  defined  speed  of 
light. 

We  are  attempting  to  advance  the  measurement  of 
R.  by  directly  measuring  cR.,  the  "Rydberg  fre¬ 
quency."  By  initially  preparing  highly  excited 
"Rydberg"  states  of  atomic  hydrogen  around  n=30, 
we  are  able  to  measure  millimeter-wave  transitions 
to  nearby  states  with  the  full  precision  of  frequency 
metrology. 

The  goal  of  our  experiment  is  three-fold:  First  is 
the  reevaluation  of  R*  itself,  providing  an  inde¬ 
pendent  check,  in  a  different  regime,  on  concurrent 
developments  in  optical  wavelength  metrology. 
Second  is  the  measurement  of  the  ground  state 
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Lamb  shift.  Because  our  measurements  involve 
high  angular  momentum  states  for  which  the  Lamb 
shift  is  extremely  small,  our  results  may  be  com¬ 
pared  with  optical  measurements  of  transitions 
between  low-lying  states  to  yield  an  improved 
measurement  of  the  Lamb  shift.  Third  is  the  fre¬ 
quency  calibration  of  the  spectrum  of  hydrogen, 
which  may  eventually  lead  to  a  single  frequency 
standard  extending  from  the  radio-frequency  regime 
to  the  ultraviolet. 

Our  experiment  is  performed  with  an  atomic  beam 
in  order  to  suppress  the  Doppler  effect  and  to  avoid 
collisional  perturbations.  Atomic  hydrogen  is 
excited  to  a  low  angular  momentum  n=29,  m=0 
state  by  two-photon  stepwise  absorption.  The 
hydrogen  atoms  are  then  transferred  to  the  longer 
lived  n=29,  m=28  "circular"  state  by  the  method  of 
crossed  electric  and  magnetic  fields.®  The  atoms 
enter  a  region  of  uniform  fields  in  which  the  fre¬ 
quency  of  the  resonant  transition  n=29,  m=28  -► 
n=30,  m=29  is  measured.  The  atoms  interact  with 
the  millimeter-wave  radiation  at  two  locations  in  a 
Ramsey  separated  oscillatory  fields  geometry.  The 
final  state  distribution  of  the  atoms  is  measured  by 


a  state-selective  electric  field  ionization  detector. 
The  resonance  signal  is  observed  as  a  transfer  of 
atoms  from  the  n=29  state  to  the  n=30  state  as  the 
millimeter-wave  frequency  is  tuned  across  the  tran¬ 
sition. 

Figure  1  illustrates  the  main  features  of  the  appa¬ 
ratus.  Atomic  hydrogen  or  deuterium  is  produced 
by  dissociation  of  H2  or  Dz  in  a  radio  frequency  dis¬ 
charge.  The  beam  is  cooled  by  collisions  with  the 
walls  of  a  cryogenically  cooled  thermalizing  channel 
in  order  to  slow  the  beam  and  thereby  increase  the 
interaction  time.  The  atoms  are  excited  to  the 
n=29,  m=28  state  by  two-photon  stepwise  excitation 
in  the  circular  state  production  region.  The 
development  of  the  hydrogen  beam  and  optical 
systems  was  described  in  a  previous  Progress 
Report.  The  magnetic  field  necessary  to  transfer 
the  atoms  to  the  circular  state  is  provided  by  per¬ 
manent  magnets.  The  electric  field  is  produced  by 
an  arrangement  of  strip  electrodes  which  allows  the 
direction  of  the  field  to  be  rotated.  A  detector  in  the 
circular  state  production  region  monitors  the  effi¬ 
ciency  of  the  laser  excitation  and  momentum 
transfer  processes. 


5  D.  Delande  and  J.C.  Gay,  Europhys.  Lett.  5;  303  (1988). 
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After  the  atoms  are  prepared  in  the  n=29  circular 
state,  the  beam  enters  the  interaction  region. 
Because  Rydberg  atoms  interact  strongly  with 
external  fields,  accurate  measurement  of  the  energy 
level  structure  requires  careful  control  of  the  inter¬ 
action  environment.  Thermal  radiation  is  reduced 
by  cooling  the  interaction  region  to  - 1 0  K  by  a 
liquid  helium  flow  system.  The  ambient  magnetic 
field  is  shielded  out  by  a  double-wall  high- 
permeability  shield.  A  small  electric  field,  which 
defines  the  quantization  axis  of  the  atoms,  is 
applied  with  high  uniformity  by  field  plates  with  cor¬ 
rective  strip  electrodes  along  the  sides.  The  milli¬ 
meter  waves  intersect  the  atomic  beam  at  two 
locations  separated  by  50  cm.  The  millimeter-wave 
optics  were  described  in  a  previous  Progress 
Report. 

The  state  distribution  of  the  atoms  emerging  from 
the  interaction  region  is  analyzed  by  an  electric  field 
ionization  detector.  The  atoms  enter  a  region  of 
increasing  electric  field  produced  by  a  ramped  plate 
held  at  constant  potential.  The  atoms  in  the  n=30 
state,  which  ionize  at  lower  field,  are  collected  in 
the  first  detector,  while  the  n=29  atoms,  which 
ionize  at  higher  field,  are  collected  at  the  second 
detector. 

In  the  past  year  we  have  begun  making  measure¬ 
ments.  Some  typical  time-resolved  Ramsey  fringe 
data  is  presented  in  figure  2.  The  accuracy  with 
which  we  can  currently  extract  R.  from  the  data  is 
limited  by  our  understanding  of  the  systematics  of 
the  experiment  to  about  1  part  in  10®.  In  order  to 
extract  all  the  necessary  systematic  information,  it 
will  be  necessary  to  increase  the  signal-to-noise 
ratio  of  the  experiment  by  a  factor  of  at  least  1 000. 
Towards  this  end,  efforts  are  underway  to  increase 
the  duty  cycle  of  the  experiment  and  to  decrease 
signal  loss  due  to  blackbody  radiation.  We  have 
acquired  a  120  Hz  excimer  pump  laser  to  replace 
our  aging  10  Hz  YAG  laser  and  have  begun 
development  of  a  new  ultraviolet  dye  laser  system. 
In  addition  to  the  improvement  in  signal  from  the 
increased  repetition  rate,  we  expect  the  new  system 
to  provide  a  narrower  UV  linewidth  and,  hence, 
improved  efficiency  of  atomic  excitation.  We  are 
also  investigating  alternative  methods  of  circular 
state  production  which  may  lead  to  better  radiation 
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Figure  2.  Preliminary  data  displaying  narrow  linewidth 
"Ramsey"  fringes.  The  deuterium  n=29,  m=28  -*  n=30, 
m=29  circular  transition  is  shown.  The  time  resolution  of 
the  detection  system  allows  each  velocity  group  to  be 
studied  independently.  The  velocity-averaged  data  is 
presented,  along  with  the  theoretical  model,  at  the  bottom 
of  the  page.  Further  refinement  of  these  experiments  will 
yield  a  measurement  of  R.  with  accuracy  of  1  part  in 
10". 

shielding  and  simpler  evaluation  of  the  magnetic 
field  strength  in  the  interaction  region. 


2.1.1  Publications 

Chang,  P.  Measurement  of  the  Rydberg  Frequency 
Using  Circular  Rydberg  States  of  Atomic 
Hydrogen.  Ph.D.  diss..  Dept,  of  Physics,  MIT, 
1992. 

Paine,  S.  High  Precision  Millimeter-Wave  Spectros¬ 
copy  of  Atomic  Hydrogen.  Ph.D.  diss..  Dept,  of 
Physics,  MIT,  1992. 
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2.2  The  Diamagnetic  Rydberg  Atom 
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U.S.  Navy  -  Office  of  Naval  Research 
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Michael  W.  Courtney,  Hong  Jiao,  Professor  Daniel 
Kleppner 

A  highly  excited  hydrogen  atom  in  a  strong  mag¬ 
netic  field,  the  so-called  "diamagnetic  hydrogen 
atom,"  is  among  the  simplest  nonseparable 
systems  in  quantum  mechanics.  Understanding 
this  atom  could  provide  a  key  to  the  more  general 
aspects  of  nonseparable  systems.  The  problem  is 
also  attracting  attention  in  the  context  of  nonlinear 
dynamics  because  its  classical  behavior  displays  a 
transition  from  orderly  to  disorderly  motion  as  the 
energy  is  increased  in  a  fixed  magnetic  field.  One 
can  study  the  quantum  structure  of  the  system  in 
this  regime  both  theoretically  and  experimentally. 
Thus,  the  diamagnetic  hydrogen  atom  provides  an 
ideal  testing  ground  for  studying  the  relation 
between  quantum  structure  and  disorderly  classical 
motion,  a  subject  sometimes  called  "quantum 
chaos." 

We  have  developed  techniques  for  carrying  out 
high  resolution  laser  spectroscopy  on  the  lithium 
atom  in  a  strong  magnetic  field.  The  difference 
between  lithium  and  hydrogen  is  minor  for  Rydberg 
atoms. 


There  are  no  general  solutions  to  this  problem  and 
perturbation  theory  is  inapplicable  in  the  positive 
energy  regime.  In  previous  years,  our  experimental 
results®  have  helped  to  stimulate  theoretical 
advances,  but  an  obsolete  superconducing  magnet 
made  further  progress  difficult.  The  apparatus  has 
been  rebuilt  using  a  new  superconducting  magnet. 

The  new  magnet  employs  a  split  coil  configuration. 
The  four-inch  bore  allows  a  significantly  larger  inter¬ 
action  region  and  more  convenient  optical  access 
than  previously  possible.  Rydberg  atoms  are  very 
sensitive  to  stray  electric  fields,  which  was  a  limiting 
factor  in  our  previous  work.  Figure  4  shows  the 
new  interaction  region  in  which  stray  electric  fields 
are  greatly  reduced  because  surfaces  are  farther 
from  the  interaction  volume.  In  addition,  any  stray 
electric  field  parallel  to  the  magnet  axis  can  be  can¬ 
celled  out  by  field  plates,  or  an  electric  field  can  be 
applied  as  an  addditional  probe  of  the  system. 
Another  improvement  in  the  interaction  region  is  the 
collection  of  light  from  flourescence  decay  of  the  3S 
state.  One  mirror  focuses  light  onto  a  light  pipe. 
Another  doubles  the  area  of  collection  by  focusing 
light  back  to  the  point  of  interaction. 

A  scintillator  and  light  pipe  have  replaced  the 
surface  barrier  diode  for  detection  of  Rydberg 
atoms.  After  atoms  are  ionized,  they  are  acceler¬ 
ated  into  the  scintillator  by  a  15  KV  potential.  Light 
produced  is  carried  to  a  photomultiplier  by  the  light 
pipe.  This  detection  method  has  the  advantage  of 
being  able  to  count  single  ions.  As  a  result,  our 
signal  is  linear  to  the  number  of  Rydberg  atoms 
produced,  and  we  detect  nearly  all  the  Rydberg 
atoms  produced. 


The  experiment  uses  a  lithium  atomic  beam  which 
is  excited  by  two  c.w,  lasers.  The  first  laser  excites 
the  atoms  from  the  2S  state  to  the  3S  state  by  a 
two-photon  transition  and  the  second  laser  excites 
the  atoms  to  Rydberg  states.  The  excited  atoms 
are  detected  by  electric  field  ionization.  We  typi¬ 
cally  operate  in  magnetic  fields  near  6T.  We  can 
determine  the  energy  within  10-^  cm-',  the  mag¬ 
netic  field  within  5  gauss.  Figure  3  shows  the 
atomic  beam  apparatus  and  superconducting 
magnet. 

The  Hamiltonian  for  the  diamagnetic  hydrogen 
atom,  in  atomic  units,  is 

+  (1) 


In  certain  regimes  the  system  displays  one¬ 
dimensional  behavior.  It  is  useful  to  write  the  Ham¬ 
iltonian  for  the  diamagnetic  hydrogen  atom  as 


JTp  +  +  Jf' 


where 


^l^b^bV, 


(2) 


(3) 


Pz 


2 


(4) 


6  D,  Delande,  A.  Bommier,  and  J.C.  Gay,  Pdys.  Rev.  Lett.  66;  141  (1991);  C.  lu,  G.R.  Welch,  M.M.  Kash,  D.  Kleppner,  D.  Delande, 
and  J.C.  Gay,  Ptjys.  Rev.  Lett.  66:  145  (1991). 
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J)',,  is  the  Hamiltonian  of  an  electron  in  a  magnetic 
field,  J'4  is  the  Hamiltonian  of  one-dimensional 
Hydrogen,  and  J'r' '  is  a  perturbation  to  the  other¬ 
wise  trivial  Hamiltonian.  Although  the  perturbing 
potential  is  not  small  (in  fact  it  is  singular  for  the 
entire  z=0  plane),  it  has  been  shown^  that  parts  of 
the  experimental  spectrum  can  be  understood 
simply  in  terms  of  the  unperturbed  Hamiltonian  as  a 
superposition  of  the  spectra  of  one-dimensional 
Hydrogen  and  an  electron  in  a  magnetic  field. 

Using  the  stray  electric  field  in  the  old  apparatus, 
we  have  studied  the  Stark  splitting  of  the  one¬ 
dimensional  hydrogen  atom  that  is  approximated  by 
magnetic  confinement  of  the  electron  transverse  to 


the  magnetic  field.  The  measured  field  is  in  rea¬ 
sonable  agreement  with  the  splitting  predicted  by 
the  one-dimensional  model.  However,  it  is  more 
desirable  to  measure  the  Stark  splitting  at  a  variety 
of  electric  fields  by  using  the  electric  field  as  a  con¬ 
trolled  probe  to  check  how  closely  our  system 
resembles  one-dimensional  Hydrogen.  The  new 
magnet  and  interaction  region  allow  this  to  be  done. 

Using  a  program  written  by  Dominique  Delande  of 
E.N.S.  Paris,  we  have  calculated  the  spectra  in 
many  regions  below  the  ionization  limit.  One  result 
of  these  calculations  suggests  that  there  are  energy 
levels  which  correspond  to  wavefunctions  localized 
near  the  z=0  plane.  These  levels  can  be  con¬ 
nected  with  eigenvalues  of  a  simple  one  dimen¬ 
sional  model,  and  they  produce  the  smallest 
anticrossings  in  the  system.  This  suggests  yet 
another  undiscovered  approximate  symmetry.  It 
also  shows  that  in  spite  of  the  chaotic  nature  of  the 
corresponding  classical  motion,  the  trivial  periodic 
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orbits  in  the  z=0  plane  have  a  strong  influence  on 
the  quantum  spectrum.  This  has  been  shown  pre¬ 
viously  in  the  quasi-Landau  modulations  (periodic 
modulations  in  the  oscillator  strength  of  1.5  times 
the  cyclotron  frequency®),  but  ours  is  the  first  work 
connecting  these  orbits  with  individual  energy 
levels. 

There  is  a  growing  interest  in  periodic  orbit  spec¬ 
troscopy.  Periodic  orbit  spectroscopy  is  a  tech¬ 
nique  that  relates  the  quantum  mechanical 
spectrum  to  the  periodic  orbits  of  the  corresponding 
classical  system.®  Classically  chaotic  systems 
possess  a  proliferation  of  periodic  orbits.  The  clas¬ 


sical  dynamics  of  diamagnetic  hydrogen  follow  a 
simple  scaling  rule,  and  the  Fourier  transform  of  the 
constant-scaled-energy  spectrum  gives  the  scaled 
classical  actions  of  periodic  orbits.  This  has  been 
shown  experimentally,'®  but  many  peaks  in  the 
Fourier  transform  were  unresolved.  A  better  tech¬ 
nique  is  needed  to  study  quantum  processes  corre¬ 
sponding  to  classical  bifurcations  of  periodic  orbits. 
Classical  bifurcation  theory  describes  the  manner  in 
which  new  periodic  orbits  can  be  created  out  of  old 
ones  or  "out  of  nowhere"  as  the  energy  changes. 

We  have  shown  that  the  results  of  Delande's 
program  can  be  used  to  calculate  the  constant- 


B  J.C.  Castro,  M.L.  Zimmerman,  R.G.  Hulet,  and  D.  Kleppner.  "Origin  and  Structure  of  the  Quasi-Landau  Resonances,"  Phys.  Rev. 
Lett.  45:  1780  (1980). 

9  A.  Holle,  J.  Main,  G.  Wiebusch,  H.  Rottke,  and  K.H.  Welge,  "Quasi-Landau  Spectrum  of  the  Chaotic  Diamagnetic  Hydrogen  Atom," 
Phys.  Rev.  Lett.  61;  161  (1988);  J.-M.  Mao  and  J.B.  Delos,  "Hamiltonian  Bifurcation  Theory  of  Closed  Orbits  in  the  Diamagnetic 
Kepler  Problem,"  Phys.  Rev.  A.  45:  1746  (1992). 

’0  A.  Holle,  J.  Main,  G.  Wiebusch.  H.  Rottke,  and  K.H.  Welge,  "Quasi-Landau  Spectrum  of  the  Chaotic  Diamagnetic  Hydrogen  Atom," 
Phys.  Rev.  Lett.  61:  161  (1988). 
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scaled-energy  spectrum.  One  constant-scaled- 
energy  spectrum  and  a  series  of  Fourier  transforms 
is  shown  in  Figure  5.  Each  peak  in  the  Fourier 
transform  corresponds  to  the  scaled  classical  action 
of  a  periodic  orbit.  These  results  are  currently 
being  compared  with  the  semi-classical  results  of 
J.B.  Delos. 


3  4 

(Atomic  Units*'*^'] 


Figure  5.  a)  Constant-scaled-energy  spectrum  for  a 
scaled  energy  of  -0.3.  b)  Fourier  transforms.  Each  peak 
corresponds  to  a  periodic  orbit  of  the  classical  system. 


2.3  Precision  Mass  Spectroscopy  of 
Ions 
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In  1992,  we  undertook  a  program  of  measurement 
of  the  atomic  masses  of  some  key  atoms  important 
for  the  improvement  of  the  nuclear  mass  table. 
Through  major  improvements  in  our  experiment,  we 
have  increased  our  precision  by  a  factor  of  5  to  less 
than  10-’°  and  performed  several  checks  on  our 
sources  of  error.  This  capability  will  allow  us  to  do 
a  variety  of  experiments  in  both  fundamental  and 
applied  physics,  including: 

•  The  °H*-°He*  mass  difference,  important  in 
ongoing  experiments  to  determine  the  electron 
neutrino  rest  mass; 

•  Determination  of  Avogadro’s  number  Na,  by 
weighing  Y-'’ays — and  combined  with  the 
Rydberg,  gives  an  independent  value  of  a; 
Determing  the  Avogadro's  number  accurately 
would  permit  the  replacement  of  the  "artifact" 
mass  standard  by  an  atomic  mass  standard. 

•  Improvement  of  many  traditional  applications  of 
mass  spectroscopy  by  orders  of  magnitude 
improvement  in  both  accuracy  and  sensitivity 
and, 

•  Determination  of  excitation  and  binding  ener¬ 
gies  of  atomic  and  molecular  ions  by  weighing 
the  small  decrease  in  energy.  Am  =  Et^nd/c^  (we 
must  reach  our  ultimate  goal  of  a  few  *  10  to 
make  this  a  generally  useful  technique). 

Our  experimental  approach  is  to  measure  ion 
cyclotron  resonance  on  a  single  molecular  or 
atomic  ion  in  a  Penning  trap,  a  highly  uniform  mag¬ 
netic  field  with  axial  confinement  provided  by 
weaker  electric  fields.  We  monitor  the  ion's  oscil¬ 
lation  along  the  magnetic  field  lines  by  detecting  the 
currents  induced  in  the  trap  electrodes.  Working 
with  only  a  single  ion  is  essential  because  space 
charge  from  other  ions  leads  to  undesired  fre¬ 
quency  shifts.  This  work  in  trapping  and  precision 
resonance  draws  on  techniques  developed  by  Hans 
Dehmelt  at  the  University  of  Washington  and 
Norman  Ramsey  at  Harvard  University,  for  which 
they  shared  the  Nobel  Prize  in  1 989. 

We  have  developed  techniques  for  driving,  cooling, 
and  measuring  the  frequencies  of  all  three  normal 
modes  of  Penning  trap  motion.  Thus,  we  can 
reproduce  our  manipulation  of  the  ion  position  to 
within  30  microns  of  the  center  of  the  trap,  cor¬ 
recting  for  electrostatic  shifts  in  the  cyclotron  fre¬ 
quency  to  great  accuracy.  We  use  a  n-pulse 
method  to  coherently  swap  the  phase  and  action  of 
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the  cyclotron  and  axial  modes."  Therefore, 
although  we  detect  only  the  axial  motion  directly, 
we  can  determine  cyclotron  frequency  by  meas¬ 
uring  the  phase  accumulated  in  the  cyclotron 
motion  in  a  known  time  interval  (figures  6a  and  6b). 

In  the  past  two  years  we  have  rebuilt  our  apparatus 
with  a  new  Penning  trap  and  quieter  rf  SQUID 
detector.  We  have  implemented  a  new  signal  pro¬ 
cessing  algorithm  to  improve  our  phase  estimation 
by  a  factor  of  2-3,  resulting  in  a  precision  of 
1x10-'°  for  a  one-minute  measurement.  Our 
entire  ion-making  process  has  been  automated,  and 
the  computer  can  cycle  from  an  empty  trap  to 
having  a  cooled  single  ion  in  about  3  minutes.  This 
has  been  partly  due  to  a  new  broadband  white 
noise  generator  that  is  used  to  expel  unwanted 
ions. In  addition,  we  have  built  an  external  ion 
source  and  associated  optics  that  can  produce  and 
select  ions  with  a  resolution  of  0.1.  When  this 
comes  on  line,  we  should  be  able  to  greatly  reduce 
the  percentage  of  "bad"  ions  and  also  alleviate  the 
problem  of  residual  neutral  gas  when  using  volatile 
species  such  as  helium. 

We  have  also  developed  a  new  measurement  tech¬ 
nique  that  enables  us  to  extend  our  precision  com¬ 
parison  to  non-doublets  of  mass  (actually  mass  to 
charge  ratio).  This  has  allowed  us  to  perform  strin¬ 
gent  checks  on  systematics  using  such  known 
ratios  as  Nj/N*  and  ArVAr**.  This  represents  a  sig¬ 
nificant  advance  in  precision  mass  spectrometry 
since  it  allows  us  to  obtain  absolute  masses  by 
direct  comparison  to  carbon.  Indeed,  we  have  com¬ 
pared  CDa  and  CDj  to  C  to  obtain  two  determi¬ 
nations  of  the  absolute  mass  of  deuterium.  Mass 
ratios  of  doublets  cannot  be  inverted  to  obtain 
absolute  masses  in  general  without  comparing 
some  species  to  carbon  (C6H,4/C7H2  to  obtain  H  or 
C3D8/C4D2  to  obtain  D,  for  example).  But  we  have 
developed  a  scheme  involving  only  doublet  compar¬ 
isons  that  uses  the  known  ArVAr**  ratio  to  obtain 
the  absolute  masses  of  several  light  atoms — H,  D, 
N,  O,  Ne,  Ar— to  about  an  order  of  magnitude 
better  than  the  current  standard  mass  table.  The 
value  of  the  deuterium  mass  obtained  using  this 
method  is  consistent  with  the  value  from  the  direct 
non-doublet  comparisons,  giving  us  confidence  in 
our  error  estimates.  With  H  and  D  thus  known,  we 
can  obtain  the  mass  of  any  ions  by  comparison  with 
a  suitable  organic  compound  of  the  same  nominal 
mass. 


With  the  field  imperfections  (both  magnetic  and 
electric)  shimmed  as  well  as  we  can,  our  systematic 
errors  are  well  below  5  x  10  ".  In  one  night  of  quiet 
magnetic  time,  we  can  get  between  20  and  30 
switches  of  the  two  ion  species.  The  lowest  statis¬ 
tical  noise  we  have  obtained  thus  far  is  2.4  x  10-’° 
per  shot  (over  the  smooth  drift  in  the  field)  and 
6x  10'"  for  the  night  (see  figure  6).  We  feel  that 
this  precision  is  perhaps  the  limit  with  the  alternate 
ion  scheme  that  we  use  now,  but  this  is  already 
quite  a  useful  precision  for  the  °H-°He  and  the 
KlAh  measurements  described  above.  We  plan  to 
improve  our  precision  either  by  shielding  field  fluctu- 


Figure  6.  (a)  Cyclotron  frequency  as  a  function  of  time 
for  alternate  CDa  and  Ar-  ions  in  our  Penning  trap.  The 
frequencies  are  obtained  after  a  50s  integration  of 
cyclotron  phase  (see  text).  The  solid  line  is  a  polynomial 
fit  to  the  drift  in  the  field  common  to  both  ions,  (b)  A 
histogram  of  the  individual  ratios  obtained  between 
neighbouring  points.  Solid  line  is  a  gaussian  distribution 
with  a  standard  deviation  of  2.4  x  10  ’°. 


”  E.A.  Cornell,  R.M.  Weisskoff,  K.R.  Boyce,  and  D.E.  Pritchard,  "Mode  Coupling  in  a  Penning  Trap:  ;i  and  a  Classical  Avoided 
Crossing,”  Phys.  Rev.  A  41:  312  (1990), 

'2  K.R.  Boyce,  Improved  Single  Ion  Cyclotron  Resonance  Mass  Spectroscopy,  Ph.D,  diss..  Dept,  of  Physics,  MIT,  1992. 
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ations  better  or  having  two  ions  in  the  trap  at  the 
same  time.'^ 

If  we  can  get  the  shot  to  shot  fluctuation  below 
1  X  10  ’°,  the  primary  source  of  measurement  noise 
will  be  the  special  relativistic  mass  shift  due  to 
thermal  fluctuations  in  cyclotron  amplitude.  We 
have  proposed  a  scheme  of  classical  squeezing 
with  parametric  drives  to  reduce  amplitude  fluctu¬ 
ations.’*  We  hope  to  demonstrate  its  ability  in 
reducing  these  fluctuations  by  a  factor  of  3  to  5. 
These  techniques  should  help  us  in  reaching  our 
ultimate  goal  in  precision  of  a  few  parts  in  10'*, 
necessary  for  weighing  chemical  bonds. 

Publications 

Boyce,  K.R.  Improved  Single  Ion  Cyclotron  Reso¬ 
nance  Mass  Spectroscopy.  Ph.D.  diss.  Dept,  of 
Physics,  MIT,  1992. 

Cornell,  E.A.,  K.R.  Boyce,  D.L.K.  Fygenson  and 
D.E.  Pritchard.  Two  Ions  in  a  Penning  Trap: 
Implications  for  Precision  Mass  Spectroscopy.” 
Phys.  Rev.  A  45:  3049-3059  (1992). 

DiFilippo,  F.,  V.  Natarajan,  K.R.  Boyce,  and  D.E. 
Pritchard.  "Classical  Amplitude  Squeezing  for 
Precision  Measurements."  Phys.  Rev.  Lett.  68: 
2859-2862  (1992). 


2.4  Atom  Interferometry 

Project  Staff 

John  E.  Berberian,  Michael  S.  Chapman, 
Christopher  R.  Ekstrom,  Troy  D.  Hammond,  Eliot  J. 
Quataert,  Dr.  H.  Joerg  Schmiedmayer,  Bridget  E. 
Tannian,  Professor  David  E.  Pritchard 

Sponsors 

Joint  Services  Electronics  Program 


Contract  DAAL03-92-C-0001 
U.S.  Army  -  Office  of  Scientific  Research 
Contract  DAAL03-89-K-0082 
U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1207 

During  1992,  we  refined  our  atom  interferometer’® 
and  began  performing  experiments  with  spatially 
separated  beams.  The  interferometer  is  now  oper¬ 
ating  with  smaller  period  gratings,’®  providing 
greater  beam  separation.  The  experiments  were 
performed  with  the  aid  of  an  interaction  region  that 
inserts  a  thin  metal  foil  between  the  beams.  This 
allowed  us  to  manipulate  the  atomic  wave  function 
in  only  one  arm  of  the  interferometer. 

Atom  interferometers  will  make  possible  quali¬ 
tatively  new  types  of  experiments  involving  inertial 
effects,  studies  of  atomic  and  molecular  properties, 
and  tests  of  fundamental  physics.  In  addition,  they 
may  ultimately  open  the  way  for  making  ultra-small 
structures  using  atom  holograms. 

•  The  relatively  large  mass  and  low  velocity  of 
atoms  make  atom  interferometers  especially 
sensitive  to  inertial  effects  such  as  rotation, 
acceleration,  and  gravity.  Sagnac  rotation  has 
been  observed  in  accord  with  theoretical  pre¬ 
dictions’*  and  sensitivity  to  gravitational  accel¬ 
eration  at  the  3x10  ®  level  has  been 
demonstrated.’®  Atom  interferometers  may 
become  the  best  absolute  accelerometers  and 
gravimeters  in  the  next  few  years. 

•  Atom  interferometers  can  be  applied  to  a 
number  of  experiments  in  fundamental  physics: 
tests  of  quantum  mechanics  such  as  the 
Aharonov-Casher  effect,’*  geometric  phases 
and  the  measurement  process,  measurement 
of  the  equality  of  proton  and  electron  charges, 
and  a  precise  measurement  of  the  momentum 
of  a  photon.  This  latter  measurement  should 
produce  a  new  high  precision  value  for  the  fun¬ 
damental  constants  Na^. 


13  E.A.  Cornell,  K.R.  Boyce,  D.L.K.  Fygenson  and  D.E.  Pritchard.  Two  Ions  in  a  Penning  Trap;  Implications  for  Precision  Mass  Spec¬ 
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•  Interferometers  for  atoms  and  molecules  will 
offer  more  accurate  ways  to  measure  intrinsic 
properties  of  these  particles,  like  their  polariza¬ 
bility.  They  will  also  open  up  new  areas  of 
study,  such  as  measurements  of  the  "index  of 
refraction"  of  a  gas  for  a  particle  beam  which 
passes  through  it. 

The  key  component  of  our  interferometer  is  the  set 
of  three  matched  transmission  diffraction  gratings 
which  we  constructed  at  the  National  Nanofabrica¬ 
tion  Facility  (NNF)  at  Cornell  University.  The 
process  allows  fabrication  of  precisely  positioned 
openings  in  thin  silicon  nitride  membranes  mounted 
in  silicon  frames.  The  pattern  created  in  the  mem¬ 
brane  is  determined  by  an  electron  beam  writer, 
making  the  process  quite  versatile.  This  process 
was  used  to  create  a  variety  of  diffraction  gratings 
used  in  the  interferometer.  In  addition,  several 
zone  plates  (atom  lenses)  were  also  built,  and  were 
later  successfully  demonstrated.'® 

Our  present  interferometer  consists  of  three  200  nm 
period  transmission  gratings,  mounted  0.66  m  apart 
on  separate  translation  stages  inside  the  vacuum 
envelope.  During  operation,  the  O'"  and  order 
beams  from  the  first  grating  strike  the  middle 

grating  (which  is  140  pm  wide)  where  they  are 
diffracted  in  the  1*'  and  -1®'  orders  so  that  they 
converge  at  the  third  grating.  At  the  second 

(middle)  grating  the  beams  have  widths  of  30  pm 
(FWHM)  and  are  separated  by  55  pm.  The  first 

two  gratings  form  an  interference  pattern  in  the 

plane  of  the  third  grating,  which  acts  as  a  mask  to 
sample  this  pattern.  The  detector,  located  0.30  m 
beyond  the  third  grating,  records  the  flux  trans¬ 
mitted  by  the  third  grating.  An  interaction  region, 
consisting  of  a  stretched  metal  foil  positioned  sym¬ 
metrically  between  two  side  electrodes,  is  inserted 
in  the  interferometer  so  that  the  atom  wave  in  the 
two  sides  of  the  interferometer  went  on  opposite 
sides  of  the  foil.  The  foil  was  10  cm  long  and  10 
microns  thick  and  the  gap  between  the  foil  and 
each  electrode,  where  the  separated  atom  beams 
traveled,  was  2  mm.  Figure  7  shows  the  design  of 
the  interferometer. 

The  data  necessary  to  determine  the  interferometer 
phase  and  contrast  are  acquired  by  modulating  the 
position  of  one  grating  relative  to  the  other  two  and 
simultaneously  recording  the  signal  from  the  atom 
counting  electronics  as  well  as  the  signal  from  an 
optical  interferometer  used  to  measure  the  relative 
position  of  the  gratings.  After  removing  data 
obscured  by  noise  spikes  from  the  hot  wire,  the 


atom  count  rate  data  are  averaged  into  bins 
according  to  relative  grating  position,  resulting  in 
the  fringe  pattern  shown  in  figure  8.  The  peak  to 
peak  amplitude  of  our  interference  signal  is  1600 
Hz,  which  enables  us  to  determine  the  interferom¬ 
eter  phase  to  a  precision  of  15  milliradians  in  1 
minute. 

By  putting  an  electric  field  on  one  side  of  the  inter¬ 
action  region,  the  interference  pattern  is  shifted. 
This  phase  shift  is  caused  by  the  DC  Stark  shift  of 
the  atom.  The  Stark  shift  is  ■aE^/2  ,  where  a  is  the 
electric  polarizability  and  E  is  the  electric  field.  The 
measured  phase  shift  was  quadratic  with  the 
applied  field,  allowing  us  to  determine  the  polariza¬ 
bility  of  the  ground  state.  Figure  9  shows  the 
phase  shifts  versus  applied  electric  field.  By 
observing  the  reduction  of  the  interference  contrast 
with  increasing  phase  shift,  we  can  measure  the 
longitudinal  coherence  length  of  the  atomic  beam. 
Our  coherence  length  is  1.6  A,  consistent  with  the 
measured  velocity  distribution. 

In  another  experiment,  we  apply  a  uniform  magnetic 
field  along  the  beam  axis  to  determine  the  quantiza¬ 
tion  direction.  By  running  a  current  down  the  metal 
foil,  perpendicular  to  the  plane  of  the  interferometer, 
we  increase  the  field  magnitude  on  one  side  of  the 
interaction  region  (thus  on  one  beam)  and  decrease 
it  on  the  other.  This  gives  a  differential  Zeeman 
energy,  and  therefore  phase,  for  the  two  paths  that 
is  proportional  to  the  septum  current  and  the 
projection  of  the  magnetic  moment  along  the  beam 
axis. 

The  interference  pattern  of  each  of  the  eight  sodium 
ground  states  shifts  independently.  There  are  five 
different  values  of  the  angular  momentum  projection 
(one  each  m?  ±2,  two  each  mF  =  i:1,0).  The  dif¬ 
ferent  interference  patterns  superpose  to  produce 
an  interference  pattern  whose  contrast  depends  on 
the  differential  Zeeman  shifts.  This  is  shown  in 
figure  10.  The  first  revival  of  contrast  is  the  point 
where  the  phase  shifts  are:  ±4n  for  the  1  mp  1  =  2 
states,  *2k  for  the  I  mp  I  -  1  states,  and  0  for  the 
mp  =  0  states. 

The  scientific  future  of  atoms  interferometers  looks 
bright:  atom  beam  sources  are  inexpensive  and 
intense  relative  to  other  particle  beams/sources 
(e.g.,  neutrons,  electrons);  several  techniques  have 
now  been  demonstrated  to  make  interferometers  for 
them;  and  the  atoms  which  may  be  used  in  them 
come  with  a  wide  range  of  parameters  such  as 
polarizability,  mass,  and  magnetic  moment.  This 
assures  the  applicability  of  these  instruments  to  a 


’9  C.R.  Ekstrom,  D.W.  Keith,  and  D.E.  Pritchard,  "Atom  Optics  Using  Microfabricated  Structures,"  App/.  Phys.  B  54:  369  (1992). 


164  RLE  Progress  Report  Number  135 


Chapter  2.  Basic  Atomic  Physics 


wide  range  of  measurements  of  both  fundamental 
and  practical  interest. 
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Figure  7.  A  schematic,  not  to  scale,  of  our  atom  interferometer  showing  the  active  vibration  isolation  system  and  the 
interaction  region.  The  10  urn  copper  foil  is  shown  between  the  two  arms  of  the  interferometer  (thick  lines  are  atom 
beams).  The  optical  interferometer  (thin  lines  are  He-Ne  beams)  is  used  for  active  vibration  isolation.  The  200-nm 
period  atom  gratings  are  indicated  by  a  vertical  dashed  line,  and  the  3.3-nm  period  optical  gratings  by  a  vertical  solid 
line. 
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position  in  nm 


Figure  8.  Interference  pattern  from  40  seconds  of  data. 
The  interference  signal  is  860  counts  per  second  with  a 
contrast  of  25%,  which  allows  us  to  determine  the  phase 
to  15  milliradians  in  1  minute.  A  background  of  120 
counts  per  second  has  been  subtracted. 


Figure  9.  Stark  phase  shifts  for  voltages  applied  to  the 
right  (open  circles)  and  the  left  (filled  circles)  sides  of  the 
interaction  region.  Phase  shift  per  applied  electric  field  in 
(volt/cm)2  is  1 .220(7)  X  10'*  for  the  left  side  and 
1.224(7)  X  10  ®  for  the  right  side.  This  measurement  sta¬ 
tistically  determines  the  dc  polarizability  of  sodium  to 
0.4%. 
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Figure  10.  Contrast  revival  from  the  independent 
precession  of  the  8  different  internal  ground  states  of 
sodium.  A  current  flows  down  the  septum,  altering  the 
size  of  an  axial  magnetic  guide. 
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Our  current  objective  is  to  obtam  samples  of  atoms 
at  very  high  density  and  ultra-low  temperatures. 
This  goal  is  pursued  by  using  a  high  flux  slower  for 
atoms,  a  dark  light  trap  to  stop  and  compress  the 
atoms,  and  a  magnetic  trap  for  the  final  confine¬ 
ment  and  cooling. 

Experiments  with  dense  samples  of  cold  neutral 
atoms  promise  exciting  new  discoveries  in  basic 
and  applied  physics.  Due  to  the  considerably 
reduced  thermal  motion  of  atoms,  the/  are  ideal  for 
high  resolution  spectroscopy  and  for  more  accurate 
atomic  frequency  standards. 

•  Collisions  of  ultra-cold  atoms  in  such  samples 
are  characterized  by  a  long  deBroglie  wave¬ 
length  and  are  dominated  by  weak  long-range 
interactions.  Since  the  collision  duration  for 
slow  atoms  greatly  exceeds  the  radiative  decay 
time,  stimulated  and  spontaneous  radiative 
transitions  can  take  place  during  the  collision. 
Slow  collisions  are  therefore  radically  different 
from  fast  collisions  studied  so  far  and  will 
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become  an  exciting  new  field  of  atomic 
physics.^° 

•  High  density  samples  of  atoms  open  possibil¬ 
ities  for  observing  quantum  collective  effects 
such  as  Bose-Einstein  condensation  and  col¬ 
lectively  enhanced  or  suppressed  radiative 
decay. 

In  1 992,  we  completed  work  on  an  inverse  Zeeman 
slower,  which  produced  the  highest  flux  of  cold 
atoms  obtained  to  date.  We  demonstrated  that 
further  enhancement  is  possible  by  transverse 
collimation  of  the  slow  atom  beam  inside  the 
slower.  The  slow  atoms  were  captured  in  a  novel 
light  trap,  a  "dark  SPOT"  (dark  spontaneous  force 
optical  trap),  which  enabled  us  to  confine  more  than 
10'°  atoms  at  densities  one  to  two  orders  of  magni¬ 
tude  higher  than  achieved  to  date.  In  addition,  two 
theoretical  papers  were  published  on  possibilities 
(and  impossibilities)  of  trapping  and  cooling  atoms 
by  external  fields. 

Inverse  Zeeman  Slower 

In  a  Zeeman  slower  the  changing  Doppler  shift  as 
an  atom  slows  is  compensated  by  the  Zeeman  shift 
in  an  inhomogeneous  magnetic  field.  This  method 
has  the  advantage  of  producing  a  continuous  beam 
of  slow  atoms  and  has  a  practically  unlimited 
velocity  capture  range.  In  the  original  implementa¬ 
tion  of  Zeeman  slowing,  however,  one  encountered 
difficulties  in  producing  beams  of  atoms  with  veloci¬ 
ties  lower  than  200  m/s.  The  major  problems  were 
off-resonant  slowing  of  atoms  after  they  left  the 
slower  and  substantial  spreading  of  the  slow  atomic 
beam  due  to  transverse  heating. 

To  solve  the  first  problem  we  chose  the  recently 
demonstrated  inverse  Zeeman  slower  which  slows 
atoms  using  o  -  light.^’  This  type  of  slower  requires 
a  magnetic  field  increasing  along  the  atom's  trajec¬ 
tory  until  the  atoms  are  slowed  to  the  desired  final 
velocity.  The  magnetic  field  drops  off  rapidly  after 
reaching  this  maximum,  thus  quickly  shifting  atoms 
out  of  resonance  and  reducing  off-resonant  slowing 
(figure  11).  The  atomic  densities  in  the  slow  beam 
were  high  enough  so  that  the  longitudinal  velocity 
distributions  could  be  determined  directly  from 


Figure  11.  Experimental  setup  for  the  inverse  Zeeman 
slower:  A  beam  of  sodium  atoms  is  slowed  by  counter- 
propagating  laser  light  inside  a  solenoid  with  a 
parabolically  increasing  magnetic  field.  The  slow  atomic 
beam  is  collimated  and  deflected  inside  the  slower  by 
transverse  two-dimensional  optical  molasses. 

absorption  spectra  for  the  first  time.  This  allowed 
the  unambiguous  measurement  of  absolute 
flux — values  of  2  x  10"  atoms/sec  at  40  m/sec  and 
10'2  atoms/sec  at  100  m/sec  through  a  4  cm^  cross 
section  were  obtained.  The  observed  drop  in  flux 
for  small  velocities  showed  an  approximate  v^ 
dependence,  consistent  with  spreading  of  the  slow 
beam  due  to  transverse  heating  during  the  slowing 
process. 

The  problem  of  transverse  spreading  of  the  slow 
atoms  has  been  reduced  by  application  of  trans¬ 
verse  cooling  with  red  molasses.  This  was  applied 
inside  the  slower,  at  the  stationary  field  point  where 
the  atoms  have  essentially  completed  their  slowing, 
but  have  not  yet  had  time  to  spread  transversely. 
The  transverse  velocities  w  re  reduced  from  -  2.5 
m/s  to  -  1  m/s,  resulting  in  a  sixfold  increase  in  the 
density  of  the  slow  atoms.  Furthermore,  deflection 
of  the  slow  atomic  beam  by  imbalancing  the 
molasses  beams  resulted  in  complete  separation  of 
the  slowed  beam  from  both  the  original  thermal 
beam  and  the  slowing  laser  beam;  this  is  important 
for  experiments  requiring  slow  atoms  unperturbed 
by  laser  beams  or  collisions  with  thermal  atoms. 
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A  Dark  Light  Trap  for  Neutral  Atoms 

The  most  common  method  of  obtaining  high  density 
samples  of  cold  atoms  is  to  use  a  magneto-optical 
trap  (MOT)  to  collect,  cool  and  confine  the  atoms. 
However,  the  maximum  density  achieved  in  a  MOT 
has  been  limited  so  far  to  - 10"  atoms/cm^  by  two 
processes:  (1)  trap  loss  due  to  collisions  between 
ground-  and  excited-state  atoms, *2  and  (2)  repulsive 
forces  between  the  atoms  caused  by  reabsorption 
of  scattered  photons  (radiation  trapping).^^ 

We  have  demonstrated  a  dark  SPontaneous-force 
Optical  Trap  ("dark  SPOT"),  in  which  all  the  above- 
mentioned  limitations  are  mitigated  by  confining  the 
atoms  mainly  in  a  ("dark")  hyperfine  ground  state 
which  does  not  interact  with  the  trapping  light.  The 
key  idea  is  "shelving"  atoms,  i.e.,  cooling  and  trap¬ 
ping  forces  are  only  exerted  on  the  small  fraction  of 
atoms  in  the  "bright"  state.  In  such  a  trap,  more 
than  10’°  sodium  atoms  have  been  confined  to  den¬ 
sities  approaching  10'^  atoms  cm-°  (figure  12).  A 
similar  number  of  atoms  have  been  previously 
trapped  by  light  forces  only  at  densities  20  times 
lower. 

The  high  densities  achieved  in  a  dark  SPOT  are 
promising  for  the  observation  of  evaporative  cooling 
after  transferring  the  atoms  into  a  magnetic  trap.  At 
densities  of  10'^  cm-°,  the  estimated  elastic  colli¬ 
sion  rate  is  already  100  s-',  much  larger  than  the 
trap  loss  rate  due  to  collisions  with  the  background 
gas. 

Trapping  and  Cooling  Atoms  by  External 
Fields 

In  a  theoretical  paper  we  have  studied  possibilities 
of  using  electric,  magnetic  and  gravitational  fields  to 
trap  atoms  in  the  ground  state.  A  trap  for  particles 
in  the  lowest  internal  state  is  highly  desirable  since 
loss  by  two-body  collision  is  endothermic  and  thus 
suppressed.  However,  we  were  able  to  prove  that 
it  is  impossible  to  trap  ground  state  particles  at  rest 
using  arbitrary  combinations  of  electric,  magnetic, 
and  gravitational  fields,  a  result  which  is  a  consider¬ 
able  generalization  of  a  previous  theorem.  Confine¬ 
ment  of  ground  state  particles  is  possible  if  they  are 
in  motion  (dynamic  equilibrium).  Our  analysis 
showed  that  axially  symmetric  storage  rings  with 
electric  or  magnetic  fields  are  possible  and  should 
be  experimentally  feasible. 


Figure  12.  Absorption  spectrum  of  a  4-mm  diameter 
cloud  of  sodium  atoms  trapped  in  a  dark  SPOT.  The 
best  fit  yields  an  optical  density  (OD)  of  1 35  which  corre¬ 
sponds  to  an  atomic  density  of  7  x  10"  cm-2  and 
-5x10’“  trapped  atoms.  Traces  with  lower  OD  were 
recorded  with  a  reduced  number  of  atoms.  The  dashed 
lines  are  calculated  spectra  for  OD  =  1,  10,  120,  135,  and 
150,  respectively. 

Recently,  cooling  schemes  for  atoms  have  been 
suggested  which  rely  solely  on  time-dependent  con¬ 
servative  forces.  We  were  able  to  devise  a 
theorem  showing  that  time-dependent  potentials 
cannot  increase  the  phase-space  density  of  atoms, 
even  if  the  motion  is  nonclassical.  "Real  cooling"  or 
"brightening"  of  atoms  is  only  possible  if  the  time 
evolution  in  non-unitary,  e.g.,  by  spontaneous  emis¬ 
sion  of  photons  or  loss  of  atoms  (evaporative 
cooling). 
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3.1  Introduction 

Sponsor 

F.L.  Friedman  Chair 

Determining  whether  the  magnitude  of  the  charge 
of  a  proton  differs  from  that  of  an  electron  or 
whether  neutrons,  neutrinos,  or  photons  are 
charged  presents  a  significant  and  fundamental 
problem.  Verifying  the  existence  of  these  charge 
differences  would  be  very  important  to  our  funda¬ 
mental  understanding  of  (1)  electric  charge  quanti¬ 
zation  and  conservation  and  (2)  the  standard  model 
and  grand  unified  theories. 

Because  these  hypothetical  small  charges  or  differ¬ 
ences  in  charges  have  no  observable  effect  on  the 
world,  we  could  only  verify  them  using  specially 
designed  experiments.  Since  there  is  no  definite 
theoretical  prediction  for  these  hypothetical  small 
charges,  we  simply  carry  out  improved  bench-top 
experiments  that  are  feasible  with  our  limited 
resources.  (This  is  in  contrast  to  studies  of  neutron 
dipole  moment,  proton  lifetime,  missing  neutrinos, 
etc.,  which  justified  a  large  scale  effort  and  commit¬ 
ment  of  resources.) 

Experiments  dating  back  70  years  have  virtually 
guaranteed  that  these  charges  or  charge  differ¬ 
ences  are  less  in  magnitude  than  10- of  the  fun¬ 
damental  electron  charge,  1.6  x  10-'®  C.  So  far  the 
results  of  our  experiments  have  further  reduced  the 
limit  of  magnitude  difference  of  proton-electron 
charge  and  the  neutron  charge  by  three  orders  of 
magnitude. 


3.2  Project  Description 

The  electric  charge  of  any  object,  material,  or 
system  is  measured  by  bringing  a  suitable  sample 
near  other  charges  or  other  charges  near  it  so  that 
electric  forces  would  produce  obsen/able  effects. 
The  materials  or  systems  that  have  been  studied  or 
are  currently  being  studied  by  various  groups 
around  the  world  include:  (1)  beams  of  single 
atoms,  which  if  not  neutral,  would  experience  a 
velocity  change  in  an  electric  field  and  be  deflected, 
change  deBroglie  wavelength,  or  interact  with 
Doppler-shifted  light;  (2)  small  aggregates,  like 
Millikan  oil  drops,  which  if  not  neutral,  would  be 
deflected  by  electric  fields;  (3)  bulk  fluids  from 
which  ordinary  charges  (ions  or  electrons)  can  be 
removed.  This  includes  many  gases  and  superfluid 
helium,  which  would  transfer  charge  when  moved 
by  flow  whether  at  low  frequency,  when  simply  let 
out  of  an  electrically  insulated  container,  or,  in  the 
case  of  liquid  He  II,  sloshing  in  a  U-tube,  or  at 
higher  frequency  such  as  in  a  standing  sound 
wave;  and  (4)  the  inverse  of  the  last  method— an 
applied  alternating  electric  field  would  induce  sound 
waves  in  the  material  if  it  is  not  neutral. 

While  many  of  these  methods  suffer  from  difficulty 
with  controlling,  reversing,  and  otherwise  character¬ 
izing  applied  electric  fields,  the  gas  efflux  method 
we  are  using  has  problems  with  flow-induced 
charge  separation,  drift  and  noise  problems  of 
essentially  dc  charge  measurements  in  the 
attoampere  range,  and  comparatively  small 
throughput  of  molecules  (about  lO^'/second).  The 
pulsed  flow  method  would  raise  the  throughput  by  a 
factor  of  500,  and  the  fact  that  the  signal  sought  is 
at  1  kHz,  i.e.,  above  the  i/f-noise  range,  should 
make  long  term  correlation  feasible  to  yield  an  addi¬ 
tional  factor  of  1 00. 

In  the  gas  efflux  method  a  siren-like  chopper 
releases  0.5  ms  gas  pulses  at  a  one  kHz  rate  into  a 
chamber  where  a  background  pressure  of  the  same 
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gas  is  maintained  by  suitable  pumps.  This  pres¬ 
sure  must  be  such  that  the  pulses  travel  a  substan¬ 
tial  distance  with  minimum  diffusion  neither  stopping 
from  viscous  retardation  nor  coming  apart  by  free 
molecule  flow.  We  calculate  that  0.5  kPa  is  approx¬ 
imately  the  correct  pressure.  These  pulses,  if 
charged,  induce  charge  on  cylindrical  electrodes 
through  which  they  travel.  The  resulting  potential 
changes  can  be  detected  with  a  suitable  electro¬ 


meter,  which  might  take  the  form  of  an  array  of  par¬ 
alleled,  cooled  FETs  to  take  advantage  of 
Shannon's  theorem. 

At  present,  we  are  using  a  crude  apparatus  assem¬ 
bled  from  pipe  fittings  with  a  pneumatically  c. aerated 
shutter  to  investigate  the  dynamics  of  single  pulses 
in  background  gas  in  order  to  design  the  pumping 
system  needed  to  handle  1000  of  these  pulses  per 
second. 
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1.1.1  Frequency  Shifting  Phenomena  In 
Free  Electron  Lasers  -  Experiment 

Frequency  upshifting  of  an  electromagnetic  wave 
incident  on  a  rapidly  time  varying  plasma  medium  is 
of  considerable  interest  recently.  Such  upshifting 
can  occur  as  a  result  of  a  time-dependent  dielectric 
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coefficient  caused  by  changes  in  plasma  density,^ 
or  as  a  result  of  a  rapidly  moving  plasma-vacuum 
boundary/  Prior  studies  have  been  confined  to 
cases  in  which  the  dielectric  is  essentially  a  passive 
medium;  that  is,  the  amplitude  of  the  wave  propa¬ 
gating  through  it  is  constant  in  time  and  space.  In 
this  paper  we  report  what  we  believe  are  the  first 
frequency  shift  measurements  in  an  active,  lasing 
medium  in  which  the  wave  amplitude  exhibits  expo¬ 
nential  growth.  Here,  the  time-varying  dielectric 
medium  is  modeled  as  an  interaction  between  the 
pulsed  relativistic  electron  beam  of  the  free  electron 
laser  (PEL),  the  magnetic  wiggler,  and  the 
copropagating  electromagnetic  wave  launched  into 
the  system.  The  detailed  theoretical  and  computa¬ 
tional  modeling  by  Shvets  and  Wurtele  is  described 
elsewhere.® 

A  schematic  of  the  PEL®  amplifier  is  shown  in  figure 
1a.  A  mildly  relativistic  electron  beam  (750  keV)  is 
generated  by  a  Marx  capacitor  bank  (Physics  Inter¬ 
national  Pulserad  110  A).  The  electrons  are 
emitted  from  a  hemispherical  graphite  cathode  by 
an  explosive  field-emission  process.  The  graphite 
anode  acts  as  an  emittance  selector,  allowing  only 
a  small  fraction  of  the  current  (90  A)  to  propagate 
through  its  2.54-mm  radius  and  62-mm-long  aper¬ 
ture. 

The  50-period  bifilar  helical  wiggler  produced  by 
current-carrying  helical  wires  has  a  period  of  3.18 
cm  and  provides  a  magnetic  field  of  uniform  ampli¬ 
tude  whose  magnitude  on  axis  is  adjustable  up  to 
1.8  kG.  The  wiggler  field  intensity  is  slowly 
increased  over  the  initial  six  periods,  providing  an 
adiabatic  input  for  the  electron  beam.  The  system, 
including  the  gun,  is  immersed  in  a  uniform  axial 
guide  magnetic  field  generated  by  a  solenoid.  The 
intensity  of  this  field  can  be  varied  up  to  a 
maximum  of  1 1 .6  kG. 

The  2-m-long  stainless-steel  drift  tube  has  an 
internal  radius  of  0.51  cm  and  acts  as  a  cylindrical 
waveguide  whose  fundamental  TEi,  mode  has  a 
cutoff  frequency  of  17.2  GHz.  The  system  is 
designed  to  operate  in  this  lowest  waveguide  mode. 
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(a) 


(b) 


Figure  1.  (a)  Free  electron  laser  set-up  (b)  schematic  of 
the  spectrum  analyzer. 

A  high-power  magnetron  operating  at  33.39  GHz  is 
the  input  power  source  for  the  PEL  amplifier.  The 
wave  launcher  consists  of  a  short  section  of  circular 
waveguide  of  radius  0.31  cm  into  which  -12  kW  of 
power  is  coupled  from  a  standard  Ka-band  rectan¬ 
gular  waveguide.  This  section  of  circular  wave¬ 
guide  supports  only  the  fundamental  TEn  mode  for 
the  operating  frequency.  Its  radius  is  then 
adiabatical'y  increased  to  the  radius  of  the  drift 
tube.  A  linearly  polarized  wave  is  thereby  injected 
into  the  interaction  region.  Half  of  the  incident 


6  S.C.  Wilks,  J.M.  Dawson,  and  W.B.  Mori,  Phys.  Rev.  Lett.  61;  337  (1989);  C.J.  Joshi,  C.E.  Clayton,  K.  Marsh,  D.B.  Hopkins,  A. 
Sessler,  and  D.  Whitlum,  IEEE  Trans.  Plasma  Sci.  18:  814  (1990);  V.R.  Goteti  and  D.K.  Kalluri,  IEEE  Trans.  Plasma  Sci.  17:  828 
(1989);  D.K.  Kalluri,  IEEE  Trans.  Antennas  Propag.  37: 1638  (1989). 

7  W.B.  Mori,  Phys.  Rev.  A  44:  5118  (1991);  P.  Sprangle,  E.  Esarey,  and  A.  Ting,  Phys.  Rev.  A  41:  4463  (1990);  H.C.  Kapteyn  and 
M.M.  Murnane,  J.  Opt.  Soc.  Am.  B  8:  1657  (1991);  R.L.  Savage  Jr.,  W.B.  Mori,  C.  Joshi,  T.W.  Johnston,  and  G.  Shvets,  Dept,  of 
Electr.  Eng.,  UCLA,  Report  No.  PPG-1422  (1992). 

8  G.  Shvets  and  J.  Wurtele,  submitted  to  Phys.  Fluids. 

0  M.E.  Conde  and  G.  Bekefi,  Phys.  Rev.  Lett.  67:  3082  (1991);  M.E.  Condo  and  G.  Bekefi,  IEEE  Trans.  Plasma  Sci.  20:  240  (1992). 


176  RLE  Progress  Report  Number  135 


Chapter  1 .  Plasma  Dynamics 


power,  with  the  correct  rotation  of  the  electric-field 
vector  of  the  amplified  circularly  polarized  wave, 
participates  in  the  FEL  interaction. 

The  output  power  from  the  FEL  is  transmitted  by 
means  of  a  conical  horn  into  a  reflection  free 
"anechoic  chamber."  A  small  fraction  of  the  radi¬ 
ation  is  then  collected  by  a  receiving  horn,  passes 
through  precision  calibrated  attenuators,  and  a 
1.7-GHz-wide  band-pass  filter.  The  power  level  is 
finally  determined  from  the  response  of  a  calibrated 
crystal  detector. 

The  spatial  growth  of  the  electromagnetic  wave 
intensity  is  determined  from  the  measurement  of 
the  output  power  as  a  function  of  the  length  of  the 
interaction  region.  This  length  is  varied  by 
changing  the  distance  that  the  electron  beam  is 
allowed  to  propagate  in  the  drift  tube.  Application 
of  a  strong  transverse  magnetic  field  locally  is  suffi¬ 
cient  to  deflect  the  electrons  into  the  waveguide 
wall  and  thereby  terminate  the  interaction  at  that 
point.  The  results  of  such  measurements  are  illus¬ 
trated  in  figure  2  which  shows  an  initial  exponential 
growth  of  the  signal  intensity,  followed  by  nonlinear 
saturation  at  a  power  level  of  ~6  MW. 

The  frequency  spectrum  of  the  output  power  is 
determined  by  a  heterodyne  technique.  A  crystal 
rectifier  is  used  as  a  mixer  for  the  33  GHz  FEL  radi¬ 
ation  and  for  radiation  from  a  variable  frequency  23 
GHz  local  oscillator.  The  resulting  beat  wave  is 
amplified  and  sent  to  an  array  of  band-pass  wave¬ 
guide  filters.  Thus,  the  calibrated  response  of  the 
various  channels  provides,  on  a  single  shot  basis,  a 
direct  histogram  of  output  power  in  terms  of  radi¬ 
ation  frequency. 

A  schematic  of  the  experimental  setup  used  to 
measure  the  frequency  spectrum  is  shown  in  figure 
1b.  An  Alfred  650  sweep  oscillator  is  used  as  a 
local  oscillator,  generating  -6  mW  at  23.39  GHz. 
This  signal  is  combined  in  a  magic  tee  with  the  FEL 
signal  (previously  attenuated  to  the  appropriate 
milliwatt  level)  and  then  sent  to  a  crystal  rectifier 
(HP  R422A).  The  nonlinear  response  of  this  crystal 
gives  rise  to  the  beat  wave  of  the  two  signals,  with 
a  frequency  of  10  GHz.  This  beat  wave  is  then 
amplified  by  a  Narda  60164  solid  state  amplifier 
and  by  a  Logimetrics  TWT,  and  finally  sent  to  the 
filter  bank.  The  bank  consists  of  32  waveguide 
band-pass  filters  adjacent  in  frequency  and  each 
one  80  MHz  wide.  These  channels  are  fed  in  par¬ 
allel  and  each  one  has  its  own  rectifier  crystal  (HP 
33330B).  Thus,  the  calibrated  respotise  of  this 
array  of  rectifiers  gives,  on  a  single  shot  basis,  the 
frequency  spectrum  of  the  output  power. 

There  are  two  factors  limiting  the  number  of  chan¬ 
nels  that  can  actually  be  used.  The  first  one  is  the 


Figure  2.  FEL  output  power  as  a  function  of  the  inter¬ 
action  length.  Beam  energy  750  keV;  beam  current  90 
A;  axial  guide  magnetic  field  4.06  kG;  wiggler  magnetic 
field  0.63  kG. 

bandwidth  of  the  TWT  amplifier,  from  9.5  to  10.5 
GHz,  allowing  only  13  channels  to  be  used.  The 
other  limitation  is  the  number  of  available 
oscilloscopes  to  monitor  the  filter  bank  channels. 
This  second  problem  is  solved  by  using  delay  lines 
to  give  temporal  separation  between  channels,  such 
that  four  of  them  can  be  added  in  a  single 
oscilloscope.  The  measured  attenuation  of  each  of 
these  delay  lines  has  to  be  taken  into  account  in 
the  analysis  of  the  data. 

The  frequency  spectrum  in  the  saturation  region, 
corresponding  to  a  measurement  with  a  full  length 
wiggler  of  -180  cm  (see  figure  2)  is  shown  in  figure 
3a.  We  observe  that  the  spectrum  is  shifted  from 
the  input  magnetron  frequency.  Its  full  width  at  half 
maximum  (FWHM)  is  less  than  160  MHz.  The  80 
MHz  width  of  each  band-pass  filter  does  not  allow  a 
resolution  better  than  this,  and,  as  a  matter  of  fact, 
the  short  duration  of  the  radiation  pulse  (-20  ns) 
limits  the  resolution  to  no  better  than  -50  MHz  (dic¬ 
tated  by  Af  At  -  1). 

To  see  if  the  frequency  shift  occurs  at  all  power 
levels,  that  is,  in  the  exponential  growth  regime  and 
in  the  saturated  regime,  we  scan  the  output  power 
as  a  function  of  interaction  length  and  observe  any 
frequency  shifts.  Figure  3b  shows  a  scan  of  power 
versus  interaction  length  (using  the  kicker  magnet) 
from  two  adjacent  channels  of  the  filter  bank, 
namely,  the  channel  corresponding  to  the  driver  fre¬ 
quency  and  its  higher  frequency  neighbor.  The 
driver  channel  initially  has  the  higher  power,  but  it  is 
surpassed  by  the  other  channel  for  longer  inter¬ 
action  distances.  From  these  measurements  and 
measurements  of  the  total  power  integrated  over 
the  1.7  GHz  frequency  band  (figure  2),  we  conclude 
that  for  short  interaction  lengths,  that  is,  in  the 
exponential  growth  regime,  the  RF  power  is  pre¬ 
dominantly  at  the  channel  centered  around  the 
input  magnetron  frequency  of  33.39  GHz. 
However,  when  one  reaches  the  nonlinear  satu- 
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power  for  five  successive  shots,  for  an  interaction  lenc'h 
of  180  cm;  the  dotted  vertical  line  shows  the  input  mag¬ 
netron  frequency,  (b)  RF  power  as  a  function  of  inter¬ 
action  length  for  two  adjacent  channels  of  the  filter  bank, 
(c)  FEL  frequency  as  a  function  of  magnetron  input  fre¬ 
quency  for  an  interaction  length  of  180  cm. 

ration  regime,  the  power  is  predominantly  in  the  fre¬ 
quency  upshifted  channel  centered  around  33.47 
GHz. 

When  the  input  magnetron  frequency  is  varied,  the 
frequency  spectrum  moves  in  proportion  to  the 
input  frequency,  but  it  is  always  shifted  towards 
higher  frequencies,  as  is  shown  in  figure  3c. 


In  summary,  we  have  observed  frequency  shifts  in 
a  pulsed  free  electron  laser  amplifier.  One  possible 
cause  could  be  the  presence  of  nonlinear  excitation 
of  side-band'®  instabilities  which  are  often  observed 
in  the  saturated  regime  of  FELs.  Since  we  observe 
frequency  upshifting  in  both  the  exponential  growth 
and  saturation  regimes  (figure  3b)  we  rule  this  out 
as  a  cause  of  our  observations.  Another  cause 
could  be  the  presence  of  self-amplification  of  spon¬ 
taneous  emission  (SASE)  which  occurs  even  in  the 
absence  of  the  magnetron  driver.  We  rule  out  this 
effect  for  two  reasons.  First,  our  measurements" 
of  SASE  have  shown  that  the  intensity  of  the  emis¬ 
sion  is  about  an  order  of  magnitude  lower  than  the 
intensity  of  the  amplified  signal  (figure  2).  Sec¬ 
ondly,  when  we  vaiy  the  magnetron  frequency,  the 
FEL  emission  frequency  tracks  the  input  magnetron 
frequency  as  illustrated  in  figure  3c. 

Our  observations  are  in  qualitative  agreement  with 
computations  modeled  on  the  premise  that  the  fre¬ 
quency  shifting  is  the  result  of  the  rapid  temporal 
variations,  typically  -10  ns,  of  the  voltage  and 
current  pulses  of  our  FEL.  This  leads  to  temporal 
variations  of  the  effective  dielectric  coefficient  asso¬ 
ciated  with  the  FEL  interaction.  The  contributions 
to  Af  are  shown'^  to  consist  of  two  components. 
One  can  be  attributed  to  "dielectric  beam  loading", 
that  is,  the  temporal  behavior  of  the  right  circularly 
polarized  wave  propagating  through  the  rela'ivistic 
electron  beam.  This  is  present  even  in  the  absence 
of  the  magnetic  wiggler  and  leads  to  frequency 
downshifting.  The  other  component  is  associated 
with  the  temporal  variations  of  the  slow  ponderomo- 
tive  wave  excited  by  the  presence  of  the  magnetic 
wiggler,  and  yields  a  frequency  upshift.  The  net  of 
the  two  effects  is  a  frequency  upshift  of  -1 00  MHz. 

We  note  that  the  magnitude  of  the  frequency 
upshifting  or  downshifting  depends  sensitively’^  on 
the  relative  temporal  voltage  and  current  histories  of 
the  relativistic  electron  beam.  These  may  differ  for 
different  regimes  of  FEL  operation.  The  aforemen¬ 
tioned  experiments  were  carried  out  in  an  FEL 
operating  in  the  Group  I  regime.  Frequency  shifting 
was  not  seen  in  the  Group  II  or  in  the  Reversed 
Field  regimes'®  which  occur  at  much  higher  guide 
magnetic  fields.  To  fully  understand  the  differences 
in  the  observed  frequency  characteristics  requires 
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Methods  A  259-.  104(1987). 
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monitoring  the  temporal  beam  voltage  and  current 
characteristics  to  a  precision  of  the  order  of  1  -2  ns, 
a  task  that  has  not  been  carried  out  to  date. 

Our  frequency  shift  measurements  were  carried  out 
in  a  Raman  FEL  amplifier.  However,  frequency 
shifts  may  also  be  important’^  in  FEL  oscillators. 
They  could  also  be  of  practical  interest  in  frequency 
chirping  using  carefully  shaped  voltage  and  current 
pulses  of  the  relativistic  electron  beam. 


1.1.2  Frequency  Shifting  Phenomena  In 
Free  Electron  Lasers  -  Theory 

The  frequency  shifting  of  light  by  a  moving  dielectric 
front  and  bulk  ionization  of  gases  has  been  t  len- 
sively  investigated  both  theoretically  and  exper¬ 
imentally.  These  studies  have  been  restricted  to 
passive  dielectric  systems — those  which  induce 
phase  shifts  but  have  negligible  gain.  An  electro¬ 
magnetic  wave  in  an  underdense  plasma,  with 

(,j2 

dielectric  e  =  1 - ^  , 


where  the  source  term  has  two  distinct  contrib¬ 
utions:  J„„  from  the  usual  (nonresonant)  cold  beam 
response  to  the  EM  wave  which  is  present  in  the 
absence  of  the  wiggler,  and  a  second  term  from  the 
resonant  FEL  interaction.  The  resonant  current  Jfei 
is  produced  by  the  ponderomotive  force  of  the  EM 
wave  beating  with  the  magnetic  field  of  the 
wiggler.’s  The  beam  is  bunched  in  the  ponderomo¬ 
tive  wave,  and  the  wigglinv^  uunches  produce  the 
current  which  syncronously  orives  the  EM  wave.  In 
the  linear  analysis  (to  which  we  restrict  ourselves  in 
this  paper),  this  '•esults  in  an  exponentially  growing 
instability.  Since  the  parameters  of  the  electron 
beam  can  be  time-dependent,  this  can  potentially 
lead  to  frequency-changing  phenomena.  The  FEL  is 
subject  to  frequency  shifts  when  the  driving  terms 

Jnr  and  Jtei  are  time-dependent. 

It  is  instructive  to  briefly  summarize  frequency 
shifting  in  passive  media,  such  as  occurs  when 
radiation  propagates  through  a  moving  inhomoge¬ 
neous  medium’®  with  no  FEL  interaction.  The  wave 
equation,  when  used  together  with  the  canonical 
momentum  conservation,  becomes: 


being  the  typical  example.  Similar  physics  can  be 
expected  in  an  active  medium  which  exhibits  both 
gain  and  phase  shifts,  such  as  the  FEL.  Since  any 
frequency  shift  creates  a  phase  shift,  these  shifts 
must  be  fully  understocd  if  the  FEL  is  used  as  a 
driver  for  high  gradient  accelerators.  They  will 
place  limits  on  the  tolerable  shot-to-shot  jitter  in 
system  parameters.  The  frequency  shifting  mech¬ 
anism  also  provides  the  first  explanation  of  the  way 
in  which  an  FEL  oscillator  can  adjust  its  frequency 
to  changes  in  beam  energy. 

Two  distinct  parameter  regimes,  a  high-power  high- 
gain  microwave  amplifier  and  a  low-gain  infrared 
oscillator,  have  been  studied  in  detail.  A  brief 
description  of  the  amplifier  analysis  is  given  below; 
a  detailed  study  is  provided  elsewhere.  The  theory 
is  consistent  with  recent  experiments'*  which  report 
frequency  upshifting  in  a  FEL  microwave  amplifier. 


JL_§_ 


dZ  C 


(2) 


where  we  have  introduced  the  normalized  vector 
potential  of  the  radiation  field  a  =  eA/mc^  e  is  the 
electron  charge,  m  is  the  electron  mass  and  c  is  the 
speed  of  light  in  vacuum.  It  is  convenient  to  trans¬ 
form  to  the  set  of  variables 


z  =  z,  (3) 

where  s  is  the  distance  from  the  head  of  the  pulse, 
which  propagates  with  group  velocity  Vg. 


The  wave  equation  for  the  vector  potential  of  the 
transverse  EM  wave  including  the  FEL  and  plasma 
coupling  is 


Using  the  eikonal  approximation,  which  in  our  nota¬ 
tion  requires  that  variations  in  z  are  much  slower 
than  in  s,  results  in: 


c  dr  ^ 


aa(s,z)  -  !Wp(s.zfVg 


dz 


2(1)0“' 


a(s,z). 


(4) 
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While  it  is  straightforward  to  integrate  equation  4  by 
quadratures,  further  insight  can  be  obtained  by 
assuming  that  the  density  gradient  is  moving  with 
constant  velocity,  Vb.  Here,  for  a  prescribed  density 
profile  u)g(t  -  z/Vb), 

a(s,z)  -  a(s,0)  exp(  -  i0(s,z)),  (5) 

and  the  frequency  change,  bw  =  54>/as,  is  now  given 
by 

6a)(s,z)  = 

PbPg 

2u)(1  -  Pb/Pg) 

(6) 

It  is  instructive  to  analyze  this  equation  in  two  limits. 

First,  expanding  the  term  (ug[s-  ~  |  jn 

equation  6  results  in  an  \  “  / 

expression  for  the  frequency  change  of  a  pulse  that 
has  propagated  a  distance  z  with  the  moving 
plasma: 


We  now  restrict  our  analysis  to  the  "zero  slippage" 
approximation,  which  neglects  the  difference 
between  group  velocity  of  radiation  and  the  velocity 
of  an  electron  beam.  This  is  reasonable  for  many 
microwave  FELs  which  have  long  pulse-lengths, 
moderately  relativistic  energies  and  a  reduction  of 
the  group  velocity  in  the  waveguide.  For  instance, 
in  a  recent  studies  of  a  Raman  FEL,’^  the  slippage 
parameter 


4)6301 


is  less  than  5  percent.  The  FEL  equations,  which 
have  temporal  and  spatial  derivatives  in  the  field 
equation,  then  become  ordinary  differential  equa¬ 
tions. 

As  is  well  known,  a  FEL  with  a  helical  wiggler  and 
axial  magnetic  field,  can  be  described,  in  the  coor¬ 
dinates  of  equation  3,  by 

dZ 

iC  a(z,s)  exp(i0)  - 

2P|| 


8a)(s,z)-pg-^Z 


acop 

a(cs) 


(7) 


This  expression  is  well  behaved  for  Vg  =  Vb,  and  our 
subsequent  analysis  of  the  FEL  amplifier  will  be 
carried  out  under  this  "zero  slippage"  assumption. 
Secondly,  the  opposite  limit,  that  of  a  step  profile 
electron  beam,  where  finite  slippage  generates  the 
frequency  shift,  can  also  be  recovered  from 
equation  6.  Radiation  starts  inside  the  electron 
beam  and  eventually  slips  out  of  the  beam, 
because  Vg  >  Vb,  is  frequency  upshifted  by 


.  b>p(s)Pi 

I  — - - exp(ie)  <  exp(  - 10)  > 


(10) 


de(Z.S) 

dz 


+  kgg  -  a)/V| 


da 

dz 
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iF 


<«p(s) 

2YuVb 


u)  -  k^Vp 
(i)  -  kjVp  -  Qq/y 


(11) 


8(0  - 


PbPg 

1  -  Pb/Pg 


(8) 


This  effect  is  important  in  understanding  the  spec¬ 
tral  characteristics  of  FEL  oscillators. 

Unfortunately,  when  the  FEL  interaction  is  included, 
the  differential  equation  for  the  vector-potential  is  no 
longer  first-order,  and  a  simple  integration,  similar 
to  the  one  above,  is  no  longer  possible. 


2niC-l|^<-|texp(-i0)>,  (12) 

'a  P|| 

where  vj-PnC  is  the  longitudinal  electron  velocity, 
0  -  (k,  +  k*)z(ot-  the  slow  varying  ponderomotive 
phase,  Vb  is  the  unperturbed  beam  velocity,  y  is  the 
beam  relativistic  factor,  Qg  -  eBo/mc  is  the  cyclotron 
frequency  due  to  the  axial  field  Bo,  F  is  a 
dimensionless  filling  factor,  C  is  a  geometric  factor 
which  depends  on  the  waveguide  and  mode.  The 
factor  C  is  defined  in  Wurtele,  Chu  and  Fajans 
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(1990)'®  and  roughly  equals  the  inverse  of  the 
waveguide  radius.  A  nonresonant  beam-loading 
term,  which  is  not  present  in  earlier  work,’®  has 
been  included  in  equation  12.  Equations  10-12 
were  linearized’®  with  the  approximation  that  force 
bunching  and  wave  perturbation  of  the  equilibrium 
orbits  is  negligible.  The  dispersion  relation  thereby 
obtained  (including  the  beam-loading  term)  can  be 
written 


National  Laboratories.  We  thank  Paul  Woskov  for 
lending  us  the  frequency  filter  bank. 


1.2  Plasma  Wave  Interactions — RF 
Heating  And  Current  Generation 


1.2.1  Introduction 


r  +  F 


Wn(S) 


w  -  k^Vb 


2Ya)Vb  0)  -  k^Vb  -  Qc/y 


^(rH-AK)2 


1  ^P||0  Plt^p 

P||0  ^ 


nC^P>^Q\(o  ^P|0 

PjfoC'A  ^ 


(13) 


Equation  13  can  be  solved  explicitly  for  the  real  part 
of  r,  the  s  derivative  of  which  is  easily  related  to 
the  frequency  shift.  Frequency  shifts  are  seen  to 
occur  because  of  two  distinct  mechanisms:  the 
usual  cold  beam  dielectric  shift  (from  the  beam 
density)  and  the  coupling  to  the  slow  space-charge 
wave,  which  is  due  to  the  FEL  interaction.  The 
beam  dielectric  shift  is  physically  equivalent  to  that 
of  a  cold  plasma  and  is  extensively  described  in  the 
literature,’®  the  FEL  term,  which  has  time- 
dependence  from  changes  in  beam  voltage  and 
current,  has  not  been  analyzed  until  now.  Numerical 
evalution  of  this  expression  for  the  frequency  shift 
from  physically  realizable  pulse  forms  for  the 
voltage  and  current  is  in  agreement  with  the 
observed  shifts  in  reference  1 2. 

Further  investigations  to  determine  the  effect  of  fre¬ 
quency  shift  phenomena  on  the  performance  of 
FEL  user  facilities  and  power  sources  for  high  gra¬ 
dient  structures  are  ongoing. 

This  work  was  supported  by  the  Air  Force  Office  of 
Scientific  Research,  Department  of  Energy,  National 
Science  Foundation,  and  Lawrence  Livermore 


Sponsors 

Lawrence  Livermore  National  Laboratories 
Subcontract  B-1 60456 

National  Aeronautics  and  Space  Administration 
Grant  NAGW-2048 
National  Science  Foundation 
Grant  ECS-88-22475 
U.S.  Department  of  Energy 
Grant  DE-FG02-91-ER-54109 

Project  Staff 

Professor  Abraham  Bers,  Dr.  Abhay  K.  Ram,  Dr. 
Carson  C.  Chow,  Dr.  Vladimir  Fuchs,  Michael  C. 
Moldoveanu,  Steven  D.  Schultz,  Luigi  Vacca 

The  work  of  this  research  group  is  concerned  with 
studies  on  the  electrodynamics  of  plasmas.  Atten¬ 
tion  is  directed  toward  understanding  the  nonlinear 
dynamics  of  plasmas  driven  by  high-frequency  elec¬ 
tromagnetic  fields  (as  in  RF  heating  and  current 
drive  of  magnetically  confined  plasmas,  or  in  laser- 
plasma  interactions)  and  the  generation  and  propa¬ 
gation  of  unstable  radiations  from  laser-plasma 
interactions  and  anisotropic  electron  distributions  in 
space  and  astrophysical  plasmas 

In  the  following  we  report  on  two  studies  of  chaotic 
dynamics:  the  first  in  stimulated  Brillouin  scattering 
with  particular  application  to  optical  fibers  and  the 
second  in  plasma  ion-cyclotron  orbits  driven  by 
electrostatic  wave  fields.  The  third  report  is  on  our 
progress  in  modeling  and  understanding  of  the 
observed  current  drive  efficiency  enhancement  in 
the  Joint  European  Torus  (JET)  through  the 
synergistic  use  of  ion-cyclotron  range  of  frequency 
(ICRF)  waves  with  lower-hybrid  current  drive 
(LHCD).  The  last  report  describes  our  recent  study 
of  the  double  stream  cyclotron  maser  amplifier. 
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1.2.2  Chaotic  Stimulated  Brillouin  one  dimension  and  time  for  the  fields  of  the  waves 

Scattering  (SBS)  in  an  Optical  Fiber  involved,  including  resonance  detuning  (8). 
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As  part  of  our  studies  on  chaotic  wave  dynamics  in 
spatially  extended  media,  we  have  considered  in 
some  detail  the  SBS  three-wave-interaction  (3WI). 

The  dynamics  of  stimulated  Brillouin  scattering  in  a 
homogeneous  medium  of  finite  extent  has  captured 
considerable  interest  both  in  laser-plasma  inter- 
actions^o  and  optical  fibers.*’  Nonstationary  and 
chaotic  behavior  had  been  previously  reported  in 
SBS  with  external  feedback  such  as  reflection  at 
the  boundaries**  or  with  models  involving  more  than 
one  pump.**  Harrison  et  al.**  have  observed  chaotic 
SBS  experimentally  in  an  optical  fiber  without  feed¬ 
back.  Gaeta  and  Boyd**  have  performed  similar 
experiments  and  obtained  similar  results.  However, 
these  scientist  propose  that  the  experimentally 
observed  aperiodic  behavior  is  due  to  amplification 
of  noise. 

We  have  shown*®  that  with  the  addition  of  reso¬ 
nance  detuning  and  without  feedback,  the  spatio- 
temporal  3W1  modeling  SBS  can  be  chaotic  in  a 
restricted  parameter  regime.  The  model  we 
propose  appears  to  be  one  of  the  simplest  SBS 
models  exhibiting  chaos  in  spatially  extended 
media.  We  utilize  the  nonlinear  3WI  equations**  in 


In  a  typical  experiment  with  fused  silica  optical 
fibers  and  a  single-mode  argon-ion  laser  operating 
at  X=51 4.5  nm,  the  parameters  are:  index  of  refrac¬ 
tion  r?=1.46;  acoustic  velocity  Ca-  5.96  x  10*  ms  ’; 
acoustic  damping  rate  Va  “  270  MHz;  nonlinear  cou¬ 
pling  constant  K-66ms-’V-’;  and  ratio  of  laser  (or 
Stokes)  to  acoustic  damping  rate  yo/Va  “  1 0-  *.'* 
Since  the  damping  length  of  the  acoustic  wave 
(Ca/Va)  isjoiich  smaller  than  the  typical  interaction 
length  ycaC/n/Kao(x-0),  we  can  ignore  the  con¬ 
vective  term  in  the  acoustic  equation.  Also,  the 
damping  on  the  laser  and  Stokes  em  waves  are 
weak  and  can  be  ignored.  Length  and  time  scales 
can  be  rescaled  with  Vat  -*■  t,  XYa(n/c)  -►  x,  A  =  b/y^, 
and  the  wave  amplitudes  can  be  rescaled  with 
Eo  -  aoK/va.  Es  -  asK/va,  Ea  -  (aaK/va)  exp{  -  iAt).  The 
SBS  equations  become 

d,Eo  +  0x^0  -  -  EsEg,  (1) 


0,Es  -  0x^5  =  EoEg,  (2) 

0,Ea  +  (1 +iA)Ea-EoEs,  (3) 

with  the  boundary  conditions  Eo(x«0,t)- 
ao(x  =  0)K/Ya  ■  A,  Es(x  -  L,t)  =  e.  The  scattered  wave 
is  assumed  to  grow  from  a  small  amplitude  (noise) 
E  at  the  right  boundary.  The  laser  wave  Eo  is 
referred  to  as  the  pump,  the  scattered  light  wave  Es 
is  often  called  the  Stokes  wave.  In  terms  of  a 


20  C.J.  Randall  and  J.R.  Albritton.  Phys.  Rev.  Lett.  52:  1887  (1984);  K.  Sauer  and  K.  Baumgarlel.  Phys.  Rev.  Lett.  52.  101  (1984);  R. 
Blaha,  E.W.  Laedke,  A.M.  Rubenchik,  and  K.H.  Spatschok,  Europhys.  Lett.  7:  237  (1988);  S.  Hiiller,  P.  Mulser,  and  A.M.  Rubenchik, 
Phys.  Fluids  B  3:  3339  (1991). 

2'  D.  Cotter,  J.  Opt  Commun.  4:  10  (1983);  I.  Bar-Joseph,  A.A,  Friesem,  E,  Lichtman,  and  R.G.  Waarts,  J.  Opt  Soc.  Am.  B  2:  1606 
(1985);  J.  Coste  and  C.  Montes.  Phys.  Rev.  A  34:  3940  (1986);  J.  Botineau.  C.  Leycuras,  C  Montes,  and  E.  Picholle,  J.  Opt  Soc. 
Am.  B  6:  300  (1989);  R.G.  Harrison.  J.S.  Uppal.  A.  Johnstone,  and  J.V.  Moloney.  Phys.  Rev.  Lett.  65:  167  (1990);  A.L.  Gaeta  and 
R.W.  Boyd.  Phys.  Rev.  A  44:  3205  (1991). 

22  C.J.  Randall  and  J.R.  Albritton  (1984);  K.  Sauer  and  K.  Baumgartel  (1984);  S.  Huller,  et  al.  (1991). 

23  p.  Narum,  A.L.  Gaeta,  M.D.  Skeldon,  and  R.W.  Boyd,  J.  Opt  Soc.  Am.  B  5:  623  (1988);  A.L.  Gaeta,  M.D.  SkekJon,  R.W.  Boyd,  P. 
Narum,  J.  Opt  Soc.  Am.  B6;  1709  (1989). 

24  R.G.  Harrison,  et  al.  (1990). 

25  A.L.  Gaeta  and  R.W.  Boyd  (1991). 

26  C.C.  Chow,  Spatiotemporal  Chaos  in  the  Nonlinear  Three  Wave  Interaction,  Ph.D.  diss..  Dept,  of  Physics,  MIT,  1991;  C.C.  Chow  and 
A.  Bers,  'Chaotic  Stimulated  Brillouin  Scattering  in  a  Finite  Length  Medium,'  Phys.  Rev.  A,  forthcoming  1993. 

27  A.  Bers,  "Linear  Waves  and  Instabilities,"  in  Plasma  Physics— Les  Houches  1972,  eds.  C.  DeWitt  and  J.  Peyraud  (New  York:  Gordon 
and  Broach  Publishers,  1975),  pp.  113-215. 

28  D.  Cotier  (1983);  R.G.  Harrison,  et  al.  (1990);  A.L.  Gaeta  and  R.  Boyd  (1991);  D.  Cotter,  Opt  Quantum  Electron.  19:  1  (1987). 
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typical  optical  fiber  experiment,  L=^  corresponds  to 
0.75  m,  A=1  corresponds  to  270  MHz,  and  iA=E=1 
corresponds  to  4  MVm  ’. 

Equations  (1)  to  (3)  were  numerically  simulated.^ 
The  system  has  four  free  parameters  A,  A,  e,  and  L 
However,  a  numerical  survey  of  the  parameter 
space  indicated  that  a  two-dimensional  surface 
could  capture  the  unfolding  behavior.  The  A-^ 
parameter  plane  for  fixed  L  and  e  was  chosen. 
Figure  4  shows  the  numerically  determined 
unfolding  diagram  in  the  &,-A  plane  for  Z.=40  and 


0  12  3 


A 


Figure  4.  Bifurcation  diagram  in  the  A-A  plane  for  L  =  40 
and  E=  0.0025.  There  are  four  different  phases:  F 
denotes  the  fixed  state,  P  denotes  the  periodic  orbit,  Q 
denotes  quasi-periodicity  and  C  denotes  chaos.  The  line 
of  x's  indicate  a  region  of  phase  coexistence  between  a 
periodic  and  a  quasi-periodic  orbit. 


£=0.0025.  Parameters  L  and  e  were  chosen  so  that 
the  bifurcation  diagram  in  the  A-A  plane  contained 
all  the  observed  dynamics.  For  small  A  and  A  there 
is  a  stable  fixed  state.  It  becomes  unstable  through 
a  Hopf  bifurcation  to  a  periodic  state.  Then  there  is 
a  transition  to  quasi-periodicity  and  to  chaos.  Each 
region  is  discussed  in  detail  in  Chow's  thesis  and  in 
a  forthcoming  publication.^o  There  it  is  shown  that 
the  system  described  by  equations  (1)  to  (3)  has 
one  fixed  state  (figure  5)  which  is  stable  when  there 
is  no  resonance  detuning,  A  =  0.  For  A  x  0,  there  is 
a  phase  twist  over  the  distance,  in  both  the  pump 
(laser)  and  Stokes  waves,  which  depends  on  both  A 
and  A.  When  the  phase  twist  exceeds  a  certain 
threshold,  the  fixed  state  becomes  unstable  and 
Hopf-bifurcates  to  a  periodic  state  (figure  6).  For 
the  chaotic  regime  (figure  7)  to  occur,  the  reflectivity 
must  be  high  and  the  medium  must  be  larger  than 
the  growth  length  of  the  Stokes  wave.  Note  that 
the  coherence  length  of  the  pump  structures  is  on 
the  order  of  the  growth  length  of  the  Stokes  wave. 
Thus  the  resulting  chaos  is  low-dimensional  and 
does  not  exhibit  spatiotemporal  chaos.^’ 


Figure  5.  Fixed  state  spatial  profiles  of  the  pump  (solid 
line),  Stokes  wave  (dashed  line)  and  acoustic  wave 
(dotted  line)  for  the  parameters  A  =  0,  A  =  1 .6,  L  =  20,  e 
=  0.0025. 


29  C.C.  Chow,  Ph.D.  diss.,  Department  of  Physics,  MIT,  1991. 

30  C.C.  Chow,  Ph.D.  diss.,  Department  of  Physics,  MIT.  1991;  C.C.  Chow  and  A.  Bars,  Phys.  Rev.  A,  forthcoming  1993. 

3'  C.C.  Chow,  Ph  D.  diss.,  Dept,  of  Physics,  MIT,  1991;  C.C.  Chow,  A.  Bers,  and  A.K.  Ram.  "Spatiotemporal  Chaos  in  the  Nonlinear 
Three  Wave  Interaction,"  Phys.  Rev.  Lett.  60:  3379  (1992). 
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Figure  6.  (a)  Periodic  state:  Spatial  profile  of  the  ampli¬ 
tudes  at  a  fixed  time  at  (A  =  1 ,  A  =  1 ).  (b)  Periodic  state 
Power  spectrum  of  pump  at  (A  =  1 ,  A  =  1 ). 

1.2.3  Chaotic  Webs  in  Ion-orbit  Chaos 
Induced  by  Waves 

Sponsors 


Figure  7.  (a)  Chaotic  state:  Spatial  profile  at  (A  =  1 ,  A  = 
5).  (b)  Chaotic  state:  Power  spectrum  of  the  pump  at  (A 
=  1.A  =  5). 
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The  motion  of  a  charged  particle  in  a  steady  mag¬ 
netic  field  and  acted  upon  by  an  electrostatic  wave, 
propagating  in  a  direction  perpendicular  to  the  mag¬ 
netic  field,  can  be  written  as: 


where  e  is  the  charge  of  a  particle  of  mass  m,  uc  is 
the  cyclotron  frequency  of  the  particle  in  the  con¬ 
stant  magnetic  field.  Eg  is  the  amplitude  of  the  elec¬ 
trostatic  wave  with  wavenumber  k  and  frequency  (u. 
By  using  the  normalized  variables  x-u)t,  q-kx, 
p-kx/u),  and  Q-cog/o),  the  Hamiltonian  corre¬ 
sponding  to  the  above  equation  of  motion  can  be 
written  as: 

2 

H(q,p,T)--|-  +  -^q^  +  ecos(q-T)  (2) 
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where  e  =  eEok/(ma)2)  is  the  normalized  amplitude  of 
the  electric  field.  By  defining  the  action-angle,  (l.ii)), 
coordinates: 


the  above  Hamiltonian  can  be  transformed  to: 


This  Hamiltonian  has  been  examined  in  a  variety  of 
regimes  by  several  authors.  The  case  when  1/Qp< 
integer  (the  non-resonant  case)  has  been  examined 
in  detail  by  Karney.“  There  it  was  found,  using  an 
analysis  similar  to  the  Chirikov  resonance  overlap 
criterion,  that  the  motion  of  the  ions  became  chaotic 
for  amplitudes  e  >  the  chaotic  region  in  p-space 
being  limited  from  below  by  trapping  condition: 
p>1-ye,  and  from  above  by  ps(10nV/Q)''® 
(where  n  is  the  closest  integer  to  1/£2).  From  these 
conditions,  it  is  clear  that  the  threshold  amplitude 
goes  down  as  the  frequency  of  the  wave  becomes 
much  higher  than  the  cyclotron  frequency  and,  cor¬ 
respondingly,  the  extent  of  the  region  in  p-space 
becomes  wider  as  the  amplitude  is  increased. 

For  1/Q  =  n,  an  integer  (i.e.,  the  wave  frequency  is 
an  integer  multiple  of  the  particle  cyclotron  fre¬ 
quency:  this  will  be  referred  to  as  the  resonant 
case),  the  motion  of  the  charged  particle  was  first 
examined  by  Fukuyama  et  al.^^  and  later  by 
Chernikov,  et  al.  and  Zaslavsky.^**  The  former  found 
conditions  for  attaining  wide-spread  chaos,  when 
p^n,  using  the  Chirikov  resonance  overlap  criterion, 
while  the  latter  showed  the  existence  of  chaotic 
"webs."  The  importance  of  the  web  is  twofold:  first, 
the  chaotic  web  has  no  amplitude  threshold  condi¬ 
tion,  although  the  chaotic  region  is  very  thin  at  low 


amplitudes;  second,  the  chaotic  web  exists  over  all 
phase  space  extending  down  to  zero  energies. 
These  characteristics  could  obviously  have  impor¬ 
tant  new  applications  for  such  wave  induced  chaos. 
The  crucial  problem  is  to  determine  the  parameter 
regime  for  which  the  web  is  sufficiently  thick  to 
have  a  significant  part  of  phase  space  chaotic,  and, 
of  course,  the  relationship  between,  and  transition 
from,  web  chaos  to  the  Chirikov-type  global,  but 
limited  range,  chaos  which  we  already  know  for  this 
problem.  Below,  we  present  some  sample  calcu¬ 
lations  illustrating  the  intriguing  aspects  of  web 
chaos. 

In  the  resonant  case  all  the  particle  orbits  are  con¬ 
fined  to  islands.  For  small  amplitudes,  e,  the 
islandsJie  between  the  zeros  of  the  Bessel  function 
Jn(Y2ln  ),  where  1/Q  -  n.  The  separatrices  of  these 
islands  are  connected  in  a  weo  structure,  and  a 
particle  started  on  this  web  will  diffuse  along  the 
web  (figure  8).  As  e  is  increased,  the  web  widens 
occupying  a  larger  fraction  of  the  phase  space. 
This  increases  the  region  of  phase  space  for  which 
the  motion  of  the  particle  is  chaotic  (figure  9). 
Since  the  web  is  connected  over  the  entire  range  of 
/,  a  particle  starting  on  the  web  can,  in  principle, 
diffuse  over  the  whole  range  of  /.  Note,  however, 
that  the  lower  energy  region  tends  toward  global 
chaos,  while  the  web  extension  to  higher  energies 
remains  rather  thin. 

In  the  slightly  nonresonant  case,  and  for  small  e, 
there  appear  untrapped  particle  orbits  which  sepa¬ 
rate  (in  /)  the  islands  mentioned  above  (figure  10). 
Even  though  the  islands  occupy  most  of  the  phase 
space,  the  web  structure  does  not  connect  all  the 
islands  as  in  the  resonant  case.  However,  there 
exists  a  periodic  web  structure  which  is  separated 
in  /  by  untrapped  particle  orbits.  The  particle  orbits 
can  diffuse  along  this  web  structure  bounded  in  /. 
Increasing  e  does  not  restore  the  web  structure  of 
the  resonant  case.  However,  it  does  lead  to  local 
regions  of  chaotic  motion  which  are  formed  by  the 
widening  of  the  webs  (figures  11a  and  11b). 

As  the  wave  frequency  and  the  particle  cyclotron 
frequency  get  farther  off  resonance,  for  small  e  a 
larger  region  of  phase  space  is  occupied  by 
untrapped  particle  orbits  (figure  11a),  until  eventu¬ 
ally.  when  the  frequencies  are  sufficiently  non¬ 
resonant,  the  entire  phase  space  is  occupied  by 


32  C.F.F.  Karney  and  A.  Bars,  Phys.  Rev.  Lett.  39:  550  (1977);  C.F.F.  Karney.  Phys.  Fluids  2^  :  1584  (1978). 

33  A.  Fukuyama,  H.  Momota,  R.  Itatini,  and  T.  Takizuka,  Phys.  Rev.  Lett.  38:  701  (1977). 

34  A.A.  Chernikov,  M.  Ya.  Natenzon,  B.A.  Petrovichev,  R.Z.  Sagdeev,  and  G.M.  Zaslavsky,  Phys.  Lett.  A  129:  377  (1988):  G.M. 
Zaslavsky,  Chaos  1: 1  (1991). 
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Figure  8.  Action  I  versus  angle  ip  for  Q  -  1/5((u-5(i)a) 
and  E  -  10-L 


Figure  9.  Action  I  versus  angle  ti>  for  Q>  1/4(a)-4u>o) 
and  £  - 10  ’. 


untrapped  particle  orbits.  In  this  case,  as  e  is 
increased,  there  again  appear  islands  in  phase 
space  where  the  particle  orbits  are  trapped. 

We  are  presently  examining  these  puzzling  features 
of  web  chaos,  a  new  type  of  chaos,  and  its  relation 
to  the  more  commonly  encountered  Chirikov  reso¬ 
nance  overlap  chaos. 


1.2.4  Lower  Hybrid  Current  Drive  in  the 
Presence  of  Waves  in  the  ion  Cyclotron 
Range  of  Frequencies 

Sponsor 

U.S.  Department  of  Energy 

Contract  DE-FG02-91-ER-54109 

In  a  set  of  recent  experiments  on  the  Joint 
European  Tokamak  (JET),^®  it  was  obsen/ed  that 
the  lower  hybrid  (LH)  current  drive  (CD)  efficiency 
was  significantly  enhanced  when  the  current  driving 
lower  hybrid  waves  (LHW)  were  launched  in  the 
presence  of  waves  in  the  ion  cyclotron  range  of  fre¬ 
quencies  (ICRF).  The  experiments  were  performed 
in  a  deuterium  plasma  with  a  hydrogen  minority;  an 
asymmetric  (current  drive  mode)  LHW  k|-spectrum 
and  a  symmetric  (heating  mode)  ICRF  k|-spectrum 


were  applied  externally.  The  experimental  condi¬ 
tions  for  the  enhancement  in  CD  efficiency  required 
that  the  ICRF  antenna  be  in  a  monopole  configura¬ 
tion  and  that  the  ion  cyclotron  and  ion-ion  hybrid 
resonance  layers  be  located  near  the  center  of  the 
plasma.  In  fact,  if  any  of  the  these  two  conditions 
were  not  satisfied,  no  enhancement  in  the  CD  effi¬ 
ciency  was  observed. 

The  initial  ICRF  power  is  carried  from  the  antenna 
into  the  center  of  the  plasma  by  fast  Alfv6n  waves 
(FAW).  At  the  resonance  layers  located  inside  the 
plasma  some  of  this  power  is  mode  converted  to 
ion-Bernstein  waves  (IBW).  These  IBWs  then  prop¬ 
agate  away  from  the  plasma  core.  The  mode  con¬ 
version  is  most  efficient  for  small  kj's  (k,  is  the 
component  of  the  total  wave  vector  along  the 
toroidal  magnetic  field). 

The  FAWs  can  directly  and  indirectly  heat  the  elec¬ 
trons.  The  direct  heating  is  through  electron  Landau 
damping  and  transit  time  magnetic  pumping.  The 
indirect  heating  is  by  the  slowing  down  of  the  ener¬ 
getic  minority  ion  tails,  generate  by  FAWs,  on  the 
electron  distribution.  This  heating  of  electrons  is 
known  to  lead  to  an  enhancement  in  the  LHCD  effi¬ 
ciency  and  was  first  obsenred  on  the  Japanese 
tokamak  JT-60  (and,  hence,  known  as  the  JT-60 


35  C.  Gormezano,  M.  Brusanti.  A.  Ekedahl,  P.  Froissard,  J.  Jaquinol,  and  F.  Rimini,  Proceedings  of  the  IAEA  Technical  Meeting  on  Fast 
Wave  Current  in  Reactor  Scale  Tokamaks  (Synergy  and  Complenwtiarity  with  LHCD  and  ECRH),  Aries,  France.  September  23-25, 
1991,  eds.  0.  Moreau.  A.  B4coulet,  and  Y.  Peysson,  pp.  244-259. 


186  RLE  Progress  Report  Number  135 


Chapter  1.  Plasma  Dynamics 


For  the  IBWs,  the  initial  mode  converted  spectrum 
has  parallel  (to  the  toroidal  magnetic  field)  phase 
velocities  which  are  much  larger  than  the  parallel 
electron  thermal  velocity.  Thus,  the  IBWs  cannot, 
initially,  interact  with  the  electron  distribution  func¬ 
tion  unless  it  has  a  runaway  tail.  The  IBW  spec¬ 
trum  also  cannot  interact  with  the  ion  distribution 
function  as  the  mode  conversion  point  is  sufficiently 
far  away  from  the  ion  cyclotron  resonance. 
However,  as  the  IBWs  propagate  away  from  the 
mode  conversion  region,  toroidal  effects  upshift  the 
Ik'iJ's  so  that  eventually  the  IBW  spectrum  can 
interact  with  the  LHCD  electron  distribution 
function.3^  The  interaction  is  in  the  form  of  Landau 
damping.  Thus,  the  IBWs  can  interact  with  the 
energetic  electron  tail  pulled  out  by  the  LHWs. 

Although  several  models  have  been  proposed,  the 
JT-60  effect  remains  largely  unexplained.  Since 
JET  observed  enhancements  in  the  CD  efficiency 
beyond  those  due  to  the  JT-60  effect,  we  have  con¬ 
centrated  our  efforts  toward  understanding  the 
direct  interaction  of  the  FAWs  and  IBWs  with  the 


Figure  11.  Action  I  versus  angle  for 
Q  -  1/5.05(u)  -  5.05  (Do):  (a)  e  -  10-';  (b)  e  -  3  x  10-^ 

LH  generated  asymmetric  electron  distribution  func¬ 
tion.  Toward  this  end,  we  have  studied  solutions  of 
the  relativistic  Fokker-Planck  equation  describing 
the  evolution  of  the  electron  distribution  function,  on 
a  given  flux  surface,  when  acted  upon  by  LHWs 


36  T.  Imai  and  JT-60  Team,  IAEA,  13th  International  Conference  on  Plasma  Physics  &  Contributions  to  Nuclear  Fusion  Research, 
Washington,  D.C.,  October  1-6,  1990,  paper  CN -53/E- 1 -3. 

37  A.K.  Ram  and  A.  Bars,  "Propagation  and  Damping  of  Mode  Converted  lon-Bernstein  Waves  in  Toroidal  Plasmas,"  Phys.  Fluids  B  3: 
1059  (1991). 
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and  either  FAWs  or  IBWs.^®  The  evolution  of  the 
flux  surface  averaged  electron  distribution  function, 
in  a  uniform  magnetic  field  and  for  wave  frequen¬ 
cies  u)  below  the  electron  cyclotron  frequency  Wcs. 
is  given  by: 


diffusion  coefficients  and  pi,  is  the  component  of  the 
electron  momentum  along  the  total  magnetic  field. 
Following  the  usual  procedures,^  we  express  the 
quasilinear  diffusion  coefficients  in  the  following 
forms: 
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where  the  Ds  are  the  appropriate  quasilinear 
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e  is  the  electron  charge.  Vs  are  the  velocities  (with 
the  component  perpendicular  to  the  magnetic  field 
having  the  subscript  i),  Ek’s  are  the  appropriate 
components  of  the  Fourier  transformed  electric  field 
spectrum,  and  the  argument  of  the  Bessel  func¬ 
tions,  Jo  and  Ji,  is  (kiVjL)/a)c«.  The  purpose  of 
expressing  the  diffusion  coefficients  in  different 
forms  for  the  LHWs  and  the  FAW/IBWs  is  that  the 
factor  multiplying  Do  in  equation  (1)  is  essentially 
determined  by  the  input  power  spectrum.  Mean¬ 
while,  Do  in  equation  (2)  depends  only  on  the  local 
plasma  properties  and  polarizations  of  the  fields. 
For  typical  JET-type  parameters,  we  find  that 
where  p,  is  the  electron  Larmor 


radius.  Then,  from  equation  (1),  one  can  see  that 
the  diffusion  coefficient  for  the  FAW  will  be  small 
compared  to  the  LHW  diffusion  coefficient  for 
similar  powers.  Then,  on  the  basis  of  this  compar¬ 
ison  between  the  FAW  and  LHW  diffusion  coeffi¬ 
cients,  one  expects  that  the  FAW  should  not  have  a 
significant  effect  on  the  LH  generated  electron  tails. 
Indeed,  numerical  solutions  of  the  Fokker-Planck 
equation  confirm  this  fact. 

Meanwhile,  since  there  is  no  easy  way  to  evaluate 
the  electric  field  polarizations  for  the  IBW,  a  similar 
type  of  analytical  analysis  comparing  the  IBW  dif¬ 
fusion  coefficient  with  the  LHW  diffusion  coefficient 
is  not  practical.  However,  numerical  evaluations 


38  A.  Bers  and  A.K.  Ram,  'Lower  Hybrid  and  Fast  Wave  Current  Drive — Status  of  Theory,’  Proceedings  of  the  IAEA  Technical  Meeting 
on  Fast  Wave  Current  in  Reactor  Scale  Tokamaks  (Synergy  and  Complementarity  with  LHCD  and  ECRH),  Arles,  France,  September 
23-25,  1991,  eds.  D.  Moreau,  A.  B6coulet,  and  Y.  Peysson,  pp.  2-34;  Ram,  A.K.,  A.  Bers,  V.  Fuchs,  R.W.  Harvey,  and  M.G.  McCoy. 
'Effects  of  Fast  Alfv6n  Waves  in  Lower  Hybrid  Current  Drive,"  Proceedings  of  the  Europhysics  Topical  Conference  on  RF  Heating 
and  Current  Drive  of  Fusion  Devices,  Brussels,  Belgium,  July  7-10,  1992,  pp.  201-204;  A.K.  Ram.  A.  Bers,  V.  Fuchs,  R.W.  Harvey, 
and  M.G.  McCoy,  'Current  Drive  by  Lower  Hybrid  Waves  in  Presence  of  Fast  Alfv4n  Waves,'  Bull.  Am.  Phys.  Soc.  37(6):  1605 
(1992);  A.K,  Ram,  A.  Bers,  V.  Fuchs,  R.W.  Harvey,  and  M.G.  McCoy,  'Effect  of  ICRF  Waves  on  Lower  Hybrid  Current  Drive,'  Pro¬ 
ceedings  of  the  US-Japan  Workshop  on  Non-Inductive  Current  Drive  and  Profile  Control,  Princeton  Plasma  Physics  Laboratory, 
14-16,  1992. 

39  C.F.  Kennel  and  F.  Engelmann,  Phys.  Fluids  9;  2377  (1966);  I.  Lerche.  Phys.  Fluids  1720  (1968). 
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show  that  the  D'®'^  is  small  compared  to  but 
larger  than  Although  the  mode  converted 

power  to  the  IBW  is  smaller  than  the  input  power 
on  the  FAW,  the  electric  field  amplitude  along  the 
IBW  increases  by  almost  an  order  of  magnitude  as 
the  IBWs  propagate  in  a  tokamak  plasma.  This 
increase  is  due  to  the  IBWs  group  velocities 
slowing  down  as  they  propagate  away  from  the 
mode  conversion  region.  Numerical  solutions  of  the 
Fokker-Planck  equation  then  show  that  the  IBWs 
can  significantly  affect  the  electron  distribution  func¬ 
tion  and,  consequently,  lead  to  an  enhancement  in 
the  current  drive  efficiency. 

Another  important  aspect  of  the  IBWs  is  that  they 
interact  with  electrons  on  the  same  flux  surfaces  on 
which  the  LHWs  generate  currents.  It  is  well  known 
that  the  LHCD  generally  occurs  well  off-axis  in  a 
tokamak.  In  experiments  the  ion  resonances  for 
ICRF  waves  are  located  near  axis.  So  the  FAW, 
which  would  interact  with  electrons  near  the  ion  res¬ 
onances,  is  not  interacting  with  the  electrons  on  the 
flux  surfaces  where  LHCD  is  generated.  However, 
IBWs  propagate  away  from  the  mode  conversion 
region  until  there  is  sufficient  enhancement  in  IkjiJ’s 
for  the  IBWs  to  interact  with  the  electron  distribution 
function.  This  interaction  generally  takes  place  well 
off-axis  in  the  region  where  LHWs  produce  electron 
tails.  This  analysis  is  in  general  agreement  with  the 
experimental  conditions  required  for  an  enhance¬ 
ment  in  the  current  drive  efficiency. 


1.2.5  Linear  Analysis  of  the  Double  Stream 
Cyclotron  Maser  Instability 

Sponsors 

National  Aeronautics  and  Space  Administration 
Grant  NAGW-2048 
National  Science  Foundation 
Grant  ECS-88-22475 

The  double  stream  cyclotron  maser  in  which  two 
copropagating  electron  beams  with  different  beam 
velocities  gyrate  in  a  uniform  axial  magnetic  field 
has  been  proposed  as  a  source  of  millimeter  wave¬ 
length  radiation.^®  The  interaction  which  leads  to  the 


radiation  is  between  the  slow  cyclotron  space 
charge  of  one  beam  and  the  fast  cyclotron  space 
charge  of  the  other  beam.  This  interaction  leads  to 
high  frequency  bunching  of  the  electrons  and  sub¬ 
sequent  radiation. 

We  have  carried  out  a  linear  instability  analysis  of 
the  linear,  uniform  medium  dispersion  relation 
describing  the  interaction  of  two  such  beams.^’  The 
dispersion  relation  describes  the  propagation  of 
electrostatic  waves,  and,  thus,  is  strictly  valid  for 
waves  with  phase  velocities  smaller  than  the  speed 
of  light.  The  analysis  shows  that  electrostatic  insta¬ 
bilities  can  be  generated  at  frequencies  lying  near 
the  doppler  shifted  electron  cyclotron  frequencies 
and  also,  approximately,  half  way  between  doppler 
shifted  electron  cyclotron  frequencies.  These  insta¬ 
bilities  exist  for  large  kj  (the  wave  vector  component 
along  the  magnetic  field)  with  tor/ki  <  c  (lo,  is  the  real 
part  of  the  frequency  and  c  is  the  speed  of  light). 
For  low  beam  densities,  typical  of  those  under  con¬ 
sideration,  the  calculations  point  out  the  distinct 
advantage  of  the  two  electron  beam  system.  For  a 
single  beam  of  electrons,  the  slow  cyclotron  space 
charge  wave  near  a  harmonic  of  the  electron 
cyclotron  frequency  does  not  interact  with  the  fast 
space  charge  wave  at  a  neighboring  harmonic.  The 
only  instabilities  in  this  case  are  near  the  electron 
cyclotron  frequency  and  its  harmonics,  and  for 
small  kji  such  that  o),/k.  >  c;  this  is  the  domain  where 
the  electrostatic  approximation  used  to  obtain  the 
dispersion  relation  breaks  down.  Thus,  the  two 
beam  system  is  important  for  generating  emission 
at  very  high  frequencies.  In  figure  12  we  plot 
(o),  -  V||kj|)/Q  (where  Vj  is  the  average  of  the  parallel 
velocities  of  the  two  beams  anc  Q  is  the  average  of 
the  electron  cyclotron  frequency  of  the  two  beams) 
versus  ck^/Q.  The  regions  instability  are  easily  iden¬ 
tifiable. 

We  have  extended  our  linear  analysis  to  include 
parallel  temperature  spreads  in  the  beams.  It  has 
been  found  that  the  growth  rates  of  the  instabilities 
decreases  as  the  temperature  goes  up.  However, 
for  parameters  relevant  to  experiments,  the  growth 
rate  is  not  substantially  reduced  so  that  instabilities 
exist  for  high  frequencies  and  short  parallel  wave¬ 
lengths. 
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1.3  Physics  of  Thermonuclear  Plasmas 
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As  our  primary  activity  in  this  research  program,  we 
study  the  theory  of  magnetically  confined  plasmas 
in  regimes  relevant  to  present-day  advanced  exper¬ 
iments  and  to  future  thermonuclear  devices.  In  the 
future,  these  devices  will  probably  ignite  the  plasma 
("thermonuclear  fuel")  within  toroidal  magnetic  con¬ 
finement  configurations  and  will  involve  either  first 
generation  fuels,  a  deuterium-tritium  mixture 
(Ignitor,  ITER),  or  more  advanced  fuels  such  as 
deuterium-deuterium  or  deuterium-helium  mixtures 
(Candor). 

The  Ignifor-Ult  machine  is  now  in  the  early  stages 
of  construction  in  Europe  At  MIT,  the  Alcator 
C-MOD  experiment  has  recently  begun  operation. 
Alcator  C-MOD  combines  the  favorable  features  of 
an  elongated  plasma  cross  section  with  a  high 
magnetic  field  to  produce  high  plasma  currents. 
This  machine  is  in  fact  very  s.milar  to  Megator,  an 
experiment  we  proposed  in  the  early  1970s  as  a 
logical  continuation  of  the  Alcator  program. 

Currently,  our  research  program  follows  two  major 
avenues.  First,  we  are  studying  the  basic  physical 
processes  of  thermonuclear  plasmas  (equilibrium, 
stability,  transport,  etc.)  as  they  apply  to  existing  or 
near-term  future  systems.  In  this  effort,  we  closely 
collaborate  with  our  experimental  colleagues  and 
with  theorists  from  other  research  groups  (e.g., 
Columbia  University,  JET,  Phillips  Laboratory, 
Princeton  University,  University  of  Texas,  Lawrence 
Livermore  National  Laboratories).  This  work  also 
involves  time-dependent  simulations  of  plasma  dis¬ 
charges  in  the  Ignitor-Ult  experiment.  We  focus 
particular  attention  on  the  evolution  of  spatial  pro¬ 
files  of  plasma  current  and  temperature.  Collab¬ 
oration  with  our  colleagues  at  the  Italian 
laboratories,  Energia  Nucleare  e  Energie  Alternative 


(E.N.E.A.),  as  well  as  inhouse  code  development  by 
scientists  "on  loan"  from  Italy  and  the  former  Soviet 
Union,  plays  a  major  role  in  this  endeavor. 

Second,  we  explore  advanced  regimes  of 
thermonuclear  burning,  including  those  employing 
low  neutron  yield  fuels  (D-^He,  and  "catalyzed" 
D-D).  We  consider  both  the  design  of  machines 
that  will  contain  thece  very  high  temperature 
plasmas  as  well  as  the  physics  that  govern  their 
behavior.  Below,  we  discuss  some  of  the  salient 
results  of  work  completed  or  presently  being  carried 
out  by  members  of  our  research  group. 


1.3.1  Theory  of  Reconnecting  Modes  in 
Collisionless  Plasmas 

We  have  investigated  plasma  processes  capable  of 
producing  new  magnetic  field  topologies  (so-called 
magnetic  reconnection)  in  collisionless,  high  tem¬ 
perature  regimes. 

In  particular,  we  have  developed  a  theory^  of  m  = 
1  modes  based  on  a  kinetic  description  of  the 
plasma  dynamics  inside  the  reconnection  layer  of 
width  5r  about  the  q(r  =  ri)  =  1  resonant  surface 
where  the  parallel  electric  field  Ej  *  0.  At  high  tem¬ 
peratures,  two  scale  distances  characterize  the 
plasma  dynamics:  the  collisionless  skin  depth  de 
=  c/tupe  a  n-  and  the  ion  sound  Larmor  radius 
ps=  (Te/mi)''VQ6  *  Here  Te.,  are  the  electron 
and  ion  temperatures,  and  Qc.  is  the  ion  gyrofre- 
quency.  Typically,  experiments  operate  in  regimes 
where  p*  >  de.  A  third  scale  distance  of  the  problem 
is  given  by  where  is  the  ideal-MHD  stability 
parameter  (a  functional  of  the  global  equilibrium 
profiles).  We  have  focused  on  the  case  of 
ideal-MHD  marginal  stability  Xh  =  0.  In  this  case, 
6r  -  ps  and  the  parallel  current  density  Jj  is  con¬ 
fined  to  a  sub-layer  of  width  -  d*.  By  numerically 
solving  the  relevant  mode  equation,  we  find  a  mc^ 
growth  rate  y/ioA-  (de/r,)  [’1  +  1 /(2yn) 
(1+T|/Te)''^P5/dB]  (u)A  is  the  shear  Alfven  wave  fre¬ 
quency)  that  can  be  quite  fast,  in  particular  for  the 
parameters  of  the  JET  experiment  when  additional 
heating  is  used  to  enhance  the  plasma  temper¬ 
ature.)  In  fact,  we  have  proposed  that  the  crash 
phase  of  the  "sawtooth  oscillations"  in  these  exper¬ 
iments  may  be  caused  by  the  excitation  of  these 
modes. 

At  high  temperatures,  another  effect  that  needs  to 
be  included  in  the  analysis  is  that  of  the  "drift"  fre- 
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quencies  oj-,  «  T/B  (j  denotes  the  species:  electrons 
or  ions),  related  to  the  local  equilibrium  density  (U) 
and  temperature  (Uj)  gradient  scale  lengths,  that  is 
strongly  stabilizing  for  w-e  >  y.  Now,  since 
co-b/coa  =  a-(1  +  Ln/LT)(ps/de)Wro,  where 

a-  =  (me/mi)'^Ls/Ln  and  Ls  is  the  magnetic  shear 
length,  cu-effects  become  dominant  at  large  ps/dg. 
By  numerically  solving  the  relevant  mode 
equation,^  we  have  verified  that,  for  realistic  values 
of  Ls/Ln,  there  is  only  a  small  interval  in  pjda  in 
which  an  enhancement  of  the  growth  rate  by  high 
temperature  effects  can  be  achieved,  with  a  further 
increase  in  temperature  (i.e.,  in  ps/dg)  leading  to  a 
strong  stabilization  of  the  mode. 


1.3.2  Linear  and  Nonlinear  Kink-Tearing 
Modes 

A  linear,  two  fluid  model  of  the  collisionless  recon¬ 
necting  mode  in  tokamaks  has  also  been  formu¬ 
lated,  leading  to  a  stability  condition,  which  is  in 
qualitative  agreement  with  experimental  obsen/a- 
tions  of  sawtooth  activity  in  contemporary,  high  tem¬ 
perature  experiments.  In  contrast  to  the  predictions 
of  the  ideal  MHD  m  =  1  mode  theory,  the  density 
and  temperature  gradients  n'(r,),  T'a.i(ri)  (associated 
with  the  so-called  (o-effects"'^)  play  a  stabilizing  role 
in  this  condition  while  the  magnetic  shear  q'(ri)  is  a 
destabilizing  factor.  Here,  q(r)  =  rB!;/(RB,))  is  the 
"safety  factor",  Bt  and  B,,  are  the  toroidal  and 
poloidal  magnetic  fields,  r  and  R  are  the  minor  and 
major  plasma  radii,  and  r,  is  the  radius  where 
q(ri)  =  1.  An  approximate  form  of  this  stability  con¬ 
dition  may  be  written  as 


(Te-kTi)ln'lR 

(T’e  +  T’i)n 

4 

(Te  +  Ti)n'  J 

>q'(',)r„  (1) 

in  terms  of  practical  units;  keV,  10'^  cm-^  Tesla, 
are  used  for  Tg,„  n  and  B. 

We  have  also  considered  the  nonlinear  evolution  of 
the  sawtooth  crash  in  tokamaks.  Specifically,  our 


study  addressed  the  early  nonlinear  stage  of  the 
crash,  when  the  plasma  central  core  displacement 
^(t)  is  still  small  with  respect  to  the  radius  r  =  r,  but 
exceeds  the  characteristic  width  of  the  narrow  layer 
in  which  the  magnetic  reconnection  occurs.  First, 
the  helically  symmetric  magnetic  geometry  of  the 
nonlinear  reconnecting  mode  was  determined  by 
solving  Waelbroeck’s  equations.^^ 

Waelbroeck's  equations  are  essentially  approximate 
forms  of  the  MHD  equilibrium  equation  and  the 
global  flux  conservation  condition,  which  are  valid 
when  the  crash  time  is  slow  compared  to  the  rele¬ 
vant  Alfv6n  time  ta=  I/wa,  but  fast  when  compared 
to  the  resistive  diffusion  time.  A  typical  result  is 
plotted  in  figure  13,  which  shows  a  poloidal  cross 
section  of  constant  helical  flux  surfaces  for  a  partic¬ 
ular  central  core  displacement.  Combining  this  sol¬ 
ution  with  the  estimate  of  the  reconnection  layer 
width  appropriatft  to  current  experiments, 
A  -  p,  =  ■^(Tg  +  T,)/mi  /Qa,  exponential  growth  of  the 
mode  was  found  with  a  rate  Y-q'(r,)p,WA.  This 
result  is  also  in  good  agreement'*®  with  experimental 
observations  of  the  "fast"  crash. 


Figure  13.  Magnetic  configuration  during  the  early  non¬ 
linear  stage. 


«  G.  Ara,  B.  Basu,  B.  Coppi,  G.  Laval,  M.N.  Rosenbluth,  and  B.V.  Waddell,  W.  V.  Ann.  Phys.  112:  443  (1978). 
**  F.L.  Waelbroeck,  Phys.  Fluids  B  1:  2372  (1989). 
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1.3.3  Impurity  Effects  on  Ion  Temperature 
Gradient  Microturbulence 

The  role  played  by  impurities  in  influencing  the 
linear  stability  properties  of  ion  temperature  gradient 
(ITG)  modes  has  been  considered  with  the  intent  to 
elucidate  the  role  played  by  this  turbulence  in 
producing  anomalous  transport  at  the  plasma  edge. 
As  is  well-known,  conventional  gyro-Bohm  transport 
models  predict  thermal  diffusivities  that  scale  as 
thereby  becoming  small  in  the  outer  plasma. 
(This  is  contrary  to  observed  trends.) 

The  addition  of  one  or  more  impurity  (ion)  species 
has  one  effect  that  may  be  termed  beneficial  in 
terms  of  transport:  it  leads  to  a  dilution  in  the 
primary  ion  concentration  (n,  =  na  -  Zini,  for  a 
hydrogenic  primary,  where  Z|  is  the  charge  of  the 
impurity  ion)  and,  thus,  a  tendency  to  stabilize  the 
standard  ITG  mode.  Indeed,  calculations^  done  in 
shearless  slab  geometry  for  collisionless  plasmas 
indicate  that  the  critical  value  of  the  ion  temperature 
gradient  parameter,  tii  a  d  In  T/d  In  n  can  increase 
by  factors  of  2-8  when  an  impurity  species  is 
present.  The  effect  is  stronger  for  higher  ionization 
states  and  larger  impurity  mass. 

This  study  was  then  carried  out^^  for  shaped 
plasmas  in  toroidal  geometry,  which  is  subject  to 
the  restriction  that  the  mode  frequency  must  be 
greater  than  the  ion  transit  frequency.  A 
collisionless,  electrostatic,  kinetic  model  was 
employed,  and  a  considerable  trend  toward  stabili¬ 
zation  due  to  the  impurity  was  found  again.  Of 
interest  is  the  fact  that  three  geometrical  factors 
associated  with  the  toroidal  geometry  and  noncir¬ 
cular  cross  section  of  the  plasma  also  contribute  to 
smaller  growth  rates.  The  Shafranov  shift  has  a 
strong  stabilizing  influence  on  the  odd-parity  mode 
and  moderate  stabilizing  influence  on  the  even- 
parity  mode.  Parity  here  refers  to  the  property  of 
the  perturbed  electrostatic  potential  when  acted 
upon  by  a  mirroring  operator  in  the  poloidal  angle, 
0  -  0,  where  0  =  0  corresponds  to  the  outside  of 

the  torus  farthest  from  the  axis  of  symmetry). 


Ellipticity  tends  to  strongly  stabilize  the  even-parity 
mode  and  only  moderately  weaken  the  growth  of 
the  odd-parity  mode.  The  situation  is  reversed  for 
triangularity  (odd-mode  affected  more  strongly), 
which  has  a  weaker  stabilizing  effect  overall  than 
ellipticity. 

While  impurities  tend  to  stabilize  the  collisionless 
plasma,  which  is  typically  found  in  the  center  of 
high  temperature  present-day  experiments,  they 
can  trigger  their  own  brand  of  drift-wave  type 
microinstabiiity  in  collisional  regimes  (e.g.,  at  the 
plasma  edge).  This  type  of  mode  is  driven  unstable 
by  the  temperature  gradient  of  the  primary  ion 
species  (like  the  ITG  mode),  but  requires  the  pres¬ 
ence  of  an  impurity  (preferably  massive  in  compar¬ 
ison  to  the  primary  ions)  and  dissipation  (e.g.,  a 
parallel  thermal  conductivity).  It  propagates  in  the 
electron  diamagnetic  direction  for  peaked  impurity 
density  profiles  (d  In  ni/d  In  ni^  1).  First  discussed 
in  the  context  of  collisionless  plasmas  with  reversed 
density  profiles  (d  In  n/d  In  a  <  0),«  its  properties 
were  elucidated  later  for  collisional  regimes.*^  Our 
recent  analysis^°  has  concentrated  on  properties  in 
sheared  slab  geometries.  This  mode  has  been 
found  to  be  robustly  unstable,  relatively  impen/ious 
to  magnetic  shear,  with  an  extended  linear 
eigenfunction  (more  than  10  ion  Larmor  radii  wide). 
Thus,  it  is  a  good  candidate  for  the  anomalous 
energy  transport  in  the  outer  region  of  the  plasma. 


1.3.4  Emissions  at  the  Harmonics  of  the 
Heiium  Gyro-Frequency 

In  recent  experiments  carried  out  with  D-T  plasmas 
in  JET,®’  emissions  were  detected  with  a  spectrum 
that  showed  distinct  peaks  at  multiples  (I  -  1  -►  8)  of 
the  ‘He  cyclotron  frequency.  An  analysis®^  of  the 
linear  stability  of  collective  modes  indicates  that 
these  emissions  are  likely  to  arise  because  of  the 
presence  of  an  anisotropic  (in  velocity  space)  popu¬ 
lation  of  high-energy  a-particles.  These  particles 
occur  naturally  as  by-products  of  D-T  thermonuclear 
fusion  reactions.  The  relevant  modes  are  shown  to 
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be  spatially  localized  in  a  narrow  layer  centered  on 
a  surface  near  the  edge  of  the  plasma  column. 
The  position  of  this  surface  is  consistent  with  that 
inferred  from  the  value  of  the  magnetic  field  that 
corresponds  via  the  cyclotron  frequency  to  the 
observed  radiation.  The  width  of  the  layer  is  com¬ 
patible  with  the  relatively  narrow  width  of  the  peaks 
in  the  emitted  spectrum.  The  outermost  portion  of 
the  plasma  is  reached  by  a-particles  that  are 
trapped  in  orbits  with  relatively  large  values  of  vjv,. 
Work  is  undenvay®^  to  determine  the  characteristics 
of  the  a-particle  velocity  space  distribution  function 
that  are  "optimal"  for  linear  growth  of  these 
cyclotron  harmonics,  so  that,  in  the  future,  emission 
spectra  in  other  experiments  can  be  used  as  a 
diagnostic  tool.  One  additional  issue  being 
explored^'*  is  the  effect  of  impurity  ions  on  the 
emitted  radiation:  they  may  play  an  important  role 
in  determining  some  of  the  finer  features  in  the 
observed  spectrum. 


1.3.5  Deuterium-Tritium  Ignition 
Experiments  with  High  Magnetic  Fields 

High  magnetic  field  experiments  have  been 
designed  to  investigate  deuterium-tritium  fusion 
ignition  conditions,  on  the  basis  of  known  exper¬ 
imental  and  theoretical  understanding  of  plasma 
behavior.  Our  group  has  long  maintained  a  strong 
effort  in  pioneering  the  practical  use  of  such  config¬ 
urations.  We  have  shown  that  the  most  advanta¬ 
geous  and  least  expensive  designs  incorporate  an 
interlocking  set  of  characteristics;  aspect  ratio,  rela¬ 
tively  small  size  with  significant  vertical  elongation, 
high  toroidal  and  poloidal  magnetic  fields,  large 
plasma  currents,  high  plasma  densities,  good 
plasma  purity,  strong  ohmic  heating,  good  plasma 
and  a-particle  confinement,  and  robustness  against 
ideal  MHD  and  resistive  plasma  instabilities.^^  We 
have  investigated  the  physical  basis  for  these 
plasma  properties,  while  also  constructing  the 
design  and  engineering  solutions  for  attaining  the 
necessary  parameters  in  the  Ignitor  Ult  machine®* 
presently  being  built  in  Italy.  Since  ignition  depends 
upon  many  spatially  and  temporally  varying  pro¬ 
cesses,  many  of  our  studies  are  based  upon  the 
numerical  simulation  of  a  free  boundary  plasma 
from  the  current  ramp  through  ignition,  using  the 


MHD  evolution  and  plasma  flux  surface  transport 
code  (TSC).®« 

The  ignition  of  a  50:50  deuterium-tritium  plasma 
requires  a  minimum  value  of  the  parameter 
OoTe  -  4  X  1 0^°  sec/m®  in  order  to  achieve  ignition 
with  Teo-Tios  15  keV,  where  no  is  the  peak  plasma 
(electron)  density,  Teo  the  peak  temperature,  and  te 
the  energy  replacement  time.  Here  ignition  is 
defined  as  the  point  when  the  plasma  heating  due 
to  fusion  a-particles,  ,  equals  the  plasma  thermal 
losses  Pl.  Relatively  high  values  of  the  plasma 
density,  no-10®’m-®,  then  require  only  moderate 
values  of  te,  whose  magnitude  is  less  easy  to 
predict  with  certainty.  Both  these  values  should  be 
achievable,  based  on  the  favorable  confinement 
properties  of  high  density  plasmas  that  have  been 
demonstrated  by  a  series  of  high  field  experiments, 
the  Alcator  A  and  C  at  MIT  and  the  FT/FTU  devices 
at  Frascati,  Italy.  Experimentally,  the  maximum 
plasma  density  no  that  can  be  supported  correlates 
with  the  ratio  Bt/Ro,  where  Bt  is  the  toroidal  mag¬ 
netic  field  at  the  center  of  the  plasma  column,  at 
major  radius  R  =  Ro-  On  the  basis  of  the  Alcator  C 
machine,  where  no-2  x  10®’  m-®  was  achieved  with 
Bt-12.5  T  and  Ro  =  0.64  m,  and  the  TFTR 
machine  at  Princeton,  where  even  larger  ratios  of 
OoRo/Bt  were  achieved,  a  configuration  with 
Ro-1.3m  and  Bt-13  T  should  be  able  to  sustain 
reliably  densities  of  10®’  m-®. 

A  strong  toroidal  magnetic  field  also  supports  a  high 
poloidal  field  Bp  and  correspondingly  large  plasma 
current  Ip.  A  significant  vertical  elongation,  e.g., 
K  -1.8,  substantially  increases  the  plasma  current 
that  can  be  carried  for  a  given  Bt  and  R.  If  the 
density  correlates  with  the  (volume)  averaged 
toroidal  current  density,  <  J,^>,  then  experimental 
results  suggest  that  somewhat  less  than  1  kA/cm® 
should  offer  considerable  margin  to  attain  the 
desired  peak  density  no-10®’m-®.  High  values  of 
Bp  produce  a  strong  rate  of  ohmic  heating,  while 
large  currents  Ip  tightly  confine  the  fast  a-particles 
produced  by  the  fusion  reactions,  so  that  they 
deposit  their  energy  in  the  center  of  the  plasma. 
The  degradation  of  the  plasma  energy  confinement 
that  is  commonly  obsenred  when  injected 
(nonohmic)  heating  is  applied,  is  reduced  at  higher 
plasma  current.  In  addition,  the  poloidal  plasma 
beta  Pp  can  be  kept  small  at  ignition,  to  improve  the 
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plasma  stability,  and,  in  particular,  to  stabilize  the 
ideal  MHD  modes  with  mode  numbers  m  =  1 ,  n  =  1 
that  are  associated  with  sawtooth  oscillations. 
Large  plasma  density  combined  with  good  ohmic 
heating  allows  ignition  at  low  plasma  temperatures. 
This  reduces  the  fusion  power,  and  therefore  the 
thermal  wall  loads.  The  low  beta  increases  the 
overall  margin  of  plasma  stability. 

The  plasma  purity  has  been  shown  to  improve  with 
increasing  plasma  density,  that  is,  the  effective 
charge  Zgtt  =  2niZ?/ne  decreases  monotonically  with 
ne  in  an  extensive  series  of  experiments  starting 
with  the  Alcator  A.  The  major  effect  of  impurities  is 
to  dilute  the  concentration  of  fusing  nuclei,  while  a 
secondary  effect  is  the  increase  of  power  loss  due 
to  bremsstrahlung  radiation.  If  auxiliary  heating  is 
not  used,  Zew  cannot  exceed  about  1.6  for  D-T 
ignition  in  the  Ignitor,  as  indicated  by  our  analyses. 

Relatively  high  plasma  edge  densities  also  help  to 
confine  impurities  to  the  scrape  off  layer,  where  the 
induced  radiation  helps  to  distribute  the  thermal  wall 
loading  more  uniformly  over  the  plasma  chamber 
surface.  The  low  ignition  temperatures  associated 
with  high  density  further  help  to  keep  the  plasma 
clean  by  reducing  the  thermal  wall  loading  that 
result  in  sputtering. 

Peaked  plasma  density  profiles  should  be  main¬ 
tained  by  external  means  such  as  pellet  injectors  if 
necessary.  Peaked  profiles  maintain  stability  to  ion 
temperature  gradient  modes  that  enhance  the  ion 
thermal  transport.  Since  the  neoclassical  (Ware) 
inward  particle  pinch  is  relatively  strong  in  a  tight 
aspect  ratio,  high  field  configuration  and  an  anoma¬ 
lous  inflow  are  also  present.  Therefore,  pellets  that 
penetrate  partway  into  the  plasma  can  be  success¬ 
fully  used  to  raise  the  plasma  density  and  produce 
peaked  profiles  near  ignition. 

Plasma  configurations  such  as  X-points  that  con¬ 
centrate  the  thermal  (particle)  heat  flux  on  localized 
areas  of  the  vessel  wall  limit  the  amount  of  fusion 
power  that  can  be  handled.  In  addition,  an  even 
more  limited  plasma  current  can  be  sustained. 
However,  X-points  and  detached  plasmas  can  be 
obtained  with  relatively  little  sacrifice  in  the  plasma 
and  magnet  parameters.  By  creating  the  conditions 
known  to  produce  "H-mode"  operation,  X-points  and 
detached  plasmas  may  prove  desirable  in  limiting 
degradation  of  the  plasma  confinement  caused  by 
nonohmic  heating. 

Divertors  represent  a  more  severe  compromise, 
since  they  alter  the  design  of  the  plasma  chamber 


and  the  toroidal  magnet.  The  major  radius  must  be 
increased  to  accommodate  a  reliable  divertor, 
reducing  the  ratio  Bt/R  and  therefore  the  maximum 
plasma  density.  The  magnetic  fields  are  also 
reduced,  lowering  the  plasma  current  and  the  ohmic 
heating,  and  increasing  p.  A  large  injected  heating 
system  becomes  necessary  to  replace  the  ohmic 
heating  and  the  resulting  degradation  of  the  plasma 
confinement  makes  low  temperature  ignition  diffi¬ 
cult.  The  divertor  plates  must  then  handle  large 
thermal  heat  fluxes.  There  is  no  demonstrated 
advantage  to  using  divertors  in  high  density 
plasmas,  while  the  cumulative  disadvantages  make 
ignition  difficult  to  attain.  Their  use  would  remove 
much  of  the  rationale  for  using  a  compact,  high  field 
machine. 


1.3.6  Time-dependent  Ignition 

The  transient  nature  of  ignition  has  important  con¬ 
sequences.  The  initial  current  ramp,  when  Ip,  np,  Bt, 
and  the  plasma  cross  section  are  increased  simul¬ 
taneously,  to  their  maximum  values,  has  important 
effects  on  the  plasma  energy  balance  and  stability 
at  ignition.s^  These  effects  arise  from  the  relatively 
slow  inward  diffusion  of  the  plasma  current,  which 
is  added  at  the  outer  surface  of  the  plasma  by  the 
current  ramp,  compared  to  the  growth  of  the  central 
temperature  due  to  plasma  heating.  The  current 
ramp  generates  an  inhomogeneous  toroidal  electric 
field  in  the  plasma  that  is  peaked  near  the  plasma 
edge  and  allows  large  values  of  ohmic  heating  at 
high  central  temperature.  The  magnetic  safety 
factor  q  can  be  easily  maintained  above  unity  or 
held  to  a  very  small  q  <  1  region  during  the  current 
ramp.  A  more  careful  study,®^  shows  that  the  mag¬ 
netic  safety  factor  can  also  be  kept  small  after  the 
ramp,  at  least  until  the  central  temperature  reaches 
high  values  and  fusion  a-particles  begin  to  appear, 
both  of  which  are  stabilizing  effects  for  m  =  1 
modes.  Furthermore,  small  amounts  of  injected 
heating  (e.g.,  Pinj  <  Poh  /2)  during  the  current  ramp 
can  maintain  a  very  small  size  (or  nonexistence)  for 
the  q  <  1  region  until  well  past  ignition,  if  central 
temperatures  approach  10  keV  by  the  end  of  the 
current  ramp,  through  the  freezing-in  of  the  central 
current  density  at  low  resistivity.  Injected  heating 
also  reduces  the  magnetic  flux  consumption 
required  to  reach  ignition,  particularly  if  ignition 
occurs  during  the  current  ramp.  These  character¬ 
istics  are  illustrated  in  table  1  (reference  case  for 
the  maximum  Ignitor  design  parameters,  peak 
Bt=  13.5  T,  lp=12  MA,  R  =  1.3  m,  a  =  0.48  m. 
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K=  1 .8,  and  n^Q™  X  10  m  I  a1  50»50  ratio  otimic  haatino  op  to  iQnitioOi  Pd  s  2Pohi  to  raduca 
and  Zetf  =  1 .2).  tha  dagraa  of  dagradation.  Sinca  a-particia  haating 

possassas  two  important  charactaristics  of  ohmic 
^  It  is  possibla  to  simultanaously  maintain  heating  that  ara  not  sharad  by  any  prasantly  avail- 

monotonically  incraasing  q  profilas  without  larga  low  3i3|g  form  of  injactad  haating — axisymmatric  daposi- 

shear  ragions  and  with  adga  values  3  <q^a  <  4  (jon  and  generation  in  the  center  of  the  plasma 

during  the  current  ramp,  to  beyond  ignition.  These  column — we  expect  that  the  degradation  should  not 

conditions  should  prevent  instabilities  associated  gs  severe, 
with  internal  plasma  modes  (e.g.,  "locked”  or 

quasistationary  modes)  that  can  be  triggered  during  The  requirement  that  the  edge  q  is  between  3  and 

•  the  current  ramp  and  often  lead  to  serious  dis-  4  (high  plasma  current)  means  that  special  care 

ruptions,  in  which  the  confinement  of  the  plasma  must  be  devoted  to  maintaining  q  >  1  up  to  ignition, 

can  be  lost.  Since  hollow  q  profiles  are  usually  If  only  ohmic  heating  is  contemplated,  the  steadily 

associated  with  the  excitation  of  internal  increasing  size  of  the  q  <  1  region  after  the  end  of 

macroscopic  modes  and  enhanced,  "anomalous"  the  current  ramp  imposes  a  more  severe  limit  on 

current  penetration,  while  ignition  is  aided  by  a  slow  the  time  in  which  the  plasma  can  ignite  and  on  the 

^  current  penetration  that  keeps  qo  >  1  for  as  long  as  required  energy  confinement  level  than  the  energy 

•  possible,  these  precautions  are  not  superfluous.  balance  alone.  This  assumes  that  sawtooth  oscil¬ 

lations  large  enough  to  destroy  the  central  peaking 
A  major  question  for  all  ignition  experiments  is  the  gf  f^g  temperature  cannot  be  avoided.  Then 

degree  of  degradation  expected  in  the  plasma  ignition  in  the  Ignitor  reference  case,  at  To  -  1 1  keV, 

energy  confinement  near  ignition,  since  D-T  ignition  requires  xe  -  0.66  seconds  and  must  occur  within 

is  easily  achieved  if  the  confinement  remains  at  the  approximately  1 .5  seconds  of  the  end  of  the  current 

^  optimal  ohmic  heating  level.  One  strategy  for  a  ramp  (table  1).  Our  theoretical  analysis,  on  the 

high  field  experiment  is  to  maintain  a  high  level  of 


Table  1.  Reference  Discharge  for  the  Ignitor  Ult.  Maximum  parameters  at  the  End  of  Ramp  Ip  =12  MA, 
Bt=  13.5T,  R  =  1.3  m,  a  =  0.48  m,  b-0.87  m,  6-0.4,  n«,=  1.1  x  10“’  m-“,  n,*,/  <n,>  %  =  2.2,  Z,,  =  1.2. 


Ignition 


time  (sec) 


internal  inductance 


poloidal  beta 


toroidal  beta  (%) 


edge  magnetic  safety  factor 


plasma  kinetic  energy  (MJ) 


peak  electron  temperature  (keV) 


energy  confinement  time  (msec) 


ohmic  heating  (MW) 


a-particle  heating  (MW) 


peak  a-particle  density  (10’^m  -3) 


bremsstrahlung  radiation  (MW) 


cyclotron  and  impurity  radiation  (MW) 


volume  where  q  s  1  (%  of  total) 


magnetic  flux  variation  (V  sec) 


bootstrap  current  (MA) 
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other  hand,  indicates  that  Ignitor  remains,  in  all 
regimes,  within  the  stability  limits  of  the  ideal  MHD 
and  resistive  m  =  1,  n  =  1  modes.  In  addition, 
moderate  amounts  of  auxiliary  heating,  Picrh 
~  10-15  MW,  started  during  the  current  ramp,  allow 
ignition  down  to  the  limits  predicted  by  noXe,  i.e., 
TEs0.4sec,  while  maintaining  very  small  q  =  1 
regions  well  beyond  ignition.  Similarly,  in  the  ohmic 
case,  if  the  requirement  of  small  q  <  1  region  is 
dropped,  either  on  the  basis  of  the  theoretical  anal¬ 
ysis  or  by  externally  stabilizing  the  sawtooth  oscil¬ 
lations,  ignition  can  also  occur  at  this  te  and  at 
times  of  ti  -  5-5.5  seconds. 

The  importance  of  ohmic  heating  during  the  ignition 
sequence  at  high  field  and  density  means  that  a 
model  for  the  electron  thermal  transport  should, 
similar  to  the  one  used  here,  simulate  ohmic 
regimes  and  reproduce  typical  toroidal  loop  volt¬ 
ages  in  steady  state  ohmic  experiments  that  are 
observed  to  be  an  almost  "universal"  constant.  In 
addition,  the  total  diffusion  coefficient  should 
increase  with  injected  heating  and  reproduce  the 
degraded  confinement  observed  in  present  exper¬ 
iments  that  are  dominated  by  injected  heating. 

A  second  major  question  for  ignition  is  the  effect  of 
variation  in  the  plasma  density  and  its  profile,  since 
pellets  injected  to  raise  the  density  are  unlikely  to 
fully  penetrate  a  high  density  plasma.  For  a  given 
level  of  thermal  transport,  there  is  an  optimum 
density  for  fastest  ignition.  A  higher  density  is  more 
favorable  under  degraded  conditions.  Higher 
density,  however,  accelerates  the  toroidal  current 
penetration  at  a  given  time  by  lowering  the  T,, 
producing  larger  q  <  1  regions  earlier  than  at  lower 
density.  This  effect  also  operates  in  the  outer  part 
of  the  plasma  radius  when  density  profiles  are 
broadened.  Thus,  for  0*0=  1-1  x  lO^'m  ®,  profile 
peaking  factors  nj  <n9>  s:  1 .9  where  <n,>  is  the 
volume  average  give  relatively  similiar  results  for 
ignition.  Broader  profiles  rapidly  lead  to  degraded 
ignition,  e.g.,  neo/<n9>  =  1.5  requires  longer  ti  and 
higher  te  and  yields  a  significantly  larger  q  =  1 
radius.  Lower  central  density,  e.g., 
neo=  6.5  X  10“m-3  at  the  end  of  the  current  ramp 
increasing  to  8-9  x  10^  by  ignition,  allows  broader 
profiles,  ngo  /<ng>  =  1 .5.  For  related  reasons, 
increasing  the  plasma  density  after  the  current 
ramp  is  more  advantageous  than  increasing  the 
density  during  the  ramp.  At  high  density,  a  region 
of  low  magnetic  shear  develops  in  the  mid-region  of 
the  minor  radius.  This  region  becomes  seriously 
unstable  when  its  value  of  q  approaches  unity, 
since  ideal  MHD  instabilities  with  m  =  1  can  occur. 
This  is  one  of  the  major  limits  on  the  broad  density 
profile  cases. 


1.4  Versator  II  Tokamak  Research 
Program 

Sponsor 

U.S  Department  of  Energy 

Contract  DE-FG02-91-ER-54109 

Project  Staff 

Professor  Miklos  Porkolab,  Edward  W.  Fitzgerald, 
Jared  P.  Squire,  Jesus  Noel  Villasenor 

Versator  II  is  a  small  tokamak  facility  (major  radius 
R  =  40  cm,  minor  radius  a  =  13  cm)  with  modest 
plasma  parameters  (magnetic  field  Bo  ~  1.3  Tesla, 
density  n* -3 x  10’®  cm  ®,  and  plasma  current 
Ip  ~  1 0-80  kA)  which  is  used  for  fundamental 
studies  of  the  interaction  of  electromagnetic  waves 
with  a  fully  ionized,  nearly  collisionless  plasma.  For 
this  purpose,  we  use  several  high  power  (Prf~  100 
kW)  microwave  sources  to  launch  waves  at  fre¬ 
quencies  near  the  electron-gyro  frequency  (fce- 
28-35  GHz)  and  the  lower  hybrid  (ion-plasma)  fre¬ 
quency  (fo=  800  MHz  or  2.45  GHz).  In  the  sections 
below,  we  describe  two  different  experimental 
projects  which  were  carried  out  during  the  past 
year. 


1.4.1  High  Beta-Poloidal  Experiments  with 
Advanced  X-Ray  Diagnostics 

High  poloidal  beta  (e6~1)  plasma  equilibria  have 
been  produced  by  both  toroidally  asymmetric 
(current  drive,  LHCD)  and  symmetric  (heating,  LHH) 
lower  hybrid  RF  injection  in  the  Versator  ll  tokamak. 
(Here  6p  is  the  ratio  of  the  plasma  pressure  to  the 
poloidal  magnetic  field  pressure,  and  e  -  a/Ro  is  the 
inverse  aspect  ratio).  In  both  cases,  the  plasma 
current  was  fully  sustained  by  the  RF,  with  the  loop 
voltage  negligibly  small  (Vioop<  0.03V).  The 
RF-created  energy  electron  distribution  function, 
which  provides  a  large  fraction  of  the  plasma 
current  and  pressure  in  these  plasmas,  is  studied 
by  means  of  x-ray  spectroscopy  of  the  electron-ion 
bremsstrahlung  plasma  emission.  Profile  measure¬ 
ments  of  the  emission  perpendicular  to  the  toroidal 
magnetic  field  and  emission  measurements  at  a  full 
range  of  angles  to  the  toroidal  magnetic  field  have 
been  carried  out. 

For  low  plasma  current  equilibria,  Pp  is  enhanced, 
and  an  outward  shift  in  major  radius  of  the  x-ray 
emission  profile  peak  was  observed,  corresponding 
to  a  Shafranov  shift  (^a~0.33)  of  the  magnetic 
axis.  The  LHCD  equilibrium  current  profile  was 
determined  from  the  x-ray  emission  profile.  The 
safety  factor  on  axis  is  calculated  as  qo  -  3±1  and  is 
nearly  independent  of  the  total  current.  The  normal- 
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ized  internal  inductance  increases  slightly  with 
lower  plasma  current,  \J2  -  0.8  -►  1 .2  .  From 
Pp  +  li/2  -  4,  the  poloidal  beta  is  calculated,  e^p  >  0.9. 
The  LHCD  x-ray  data  indicate  a  highly  anisotropic 
energetic  electron  distribution  function  with  a 
density  approximately  1  percent  of  the  bulk  electron 
density  at  the  center.  The  stored  energy  of  this  dis¬ 
tribution  is  much  larger  than  the  bulk  energy  and 
the  global  energy  confinement  time  scales  in  agree¬ 
ment  with  the  Kaye-Goldston  L-mode  scaling 

(te  «  IJ2). 

At  high  values  of  Pp  -  3,  LHH  generates  nearly  the 
same  plasma  current  as  LHCD.  The  LHH  equilibria 
has  a  reduction  of  up  to  a  factor  of  four  in  the  high 
energy  (>  30  keV)  x-ray  emission  flux,  as  compared 
to  LHCD.  Modeling  of  the  electron  distribution  func¬ 
tion  in  the  LHH  case  indicates  that  only  one  third  of 
the  current  is  carried  by  the  high  energy  electrons. 
At  lower  values  of  pp~  1.5,  LHH  requires  the 
assistance  of  a  small  applied  loop  voltage 
(Vioop-  0.2V)  to  maintain  the  plasma  current.  The 
high  energy  electrons  carry  a  majority  of  the  plasma 
current.  Soft  x-ray  data  indicates  the  presence  of  an 
enhanced  intermediate  energy  electron  population 
with  a  temperature  of  Tm-IS  keV.  Calculations 
show  that  for  the  high  Pp  -  3  equilibria,  the  interme¬ 
diate  energy  electrons  could  carry  a  majority  of  the 
current  ( -  70%)  in  the  form  of  bootstrap  current. 

Thesis 

Squire,  J.P.  An  Experimental  Study  of  Lower  Hybrid 
Wave-driven  High-Poloidal  Beta  Plasmas  in  the 
Versator  II  Tokamak.  Ph.D.  diss..  Dept,  of 
Physics,  MIT,  1993. 


1.4.2  Fast  Wave  Current  Drive  Experiments 

Previously  we  had  discussed  a  combined  fast 
wave-slow  lower  hybrid  wave  current  drive  exper¬ 
iment  where  two  traveling  waves  are  launched. 
Slow  lower  hybrid  waves  launched  at  2.45  GHz  by 
a  conventional  grill  form  a  suprathermal  electron 
tail,  which  is  then  expected  to  absorb  an  800  MHz 
fast  wave  spectrum  launched  by  a  specially  built 
dielectric- loaded  waveguide  array.  This  is  expected 


to  enhance  the  current  driven  by  the  2.45  GHz  slow 
wave,  and  raise  the  current  drive  efficiency  at  high 
densities  where  slow  wave  current  drive  efficiencies 
are  degraded. 

We  have  built  and  successfully  tested  the  800  MHz 
antenna,  and  showed  that  its  coupling  to  the 
plasma  corresponds  well  with  theory.  However,  high 
power  experiments  (Prt  <  25  kW)  using  the  fast 
wave  antenna  in  conjunction  with  the  2.45  GHz 
launcher  have  failed  to  produce  any  form  of  current 
drive  or  central  heating,  as  shown  by  measure¬ 
ments  using  tangential  and  perpendicular  hard  x-ray 
detectors.  Further,  only  edge  parameters  appear  to 
be  affected  with  the  application  of  RF  power.  Our 
code  calculations  indicate  that  according  to  linear 
wave  theory,  a  significant  fraction  of  the  power 
should  have  penetrated  the  plasma  interior  and  also 
should  have  been  absorbed  by  fast  electrons.  We 
conclude  that  the  fast  waves  must  be  absorbed  on 
the  plasma  surface,  possibly  by  non-linear  effects 
such  as  parametric  decay  or  scattering  by  low  fre¬ 
quency  edge  density  fluctuations. 

Miniature  retarding  potential  energy  analyzer  probes 
have  been  installed  at  different  edge  locations  on 
Versator  II.  A  thin  region  where  electrons  are 
heated  from  5-10  eV  to  as  high  as  100  eV  was 
detected.  This  region  has  a  radial  extent  of  approxi¬ 
mately  1  cm,  and  a  height  of  2  cm,  which  corre¬ 
sponds  to  the  antenna  height.  The  region  of  hot 
electrons  is  located  just  behind  the  limiter  edge  and 
along  the  midplane  of  the  tokamak  (coplanar  with 
the  antenna  array).  This  phenomena  appears  to  be 
well  localized  in  that  region;  no  heating  was  seen  in 
top  or  bottom  probes.  The  heating  also  begins 
when  the  applied  RF  power  exceeds  200  W. 

We  have  also  installed  several  electrostatic  probes 
to  measure  the  parametric  decay  spectra  at  dif¬ 
ferent  toroidal  and  poloidal  locations.  Parametric 
decay  activity  was  clearly  seen  when  the  fast  wave 
power  is  applied.  We  note  asymmetric  broadening 
of  the  pump  frequency  with  more  activity  occuring 
on  the  lower  frequency  side  of  the  pump.  This  phe¬ 
nomena  has  a  measured  threshold  of  Prt  =  200  W, 
which  corresponds  with  the  threshold  of  the  edge 
electron  heating. 
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Polarimetric  radar  calibration  algorithms  using  a 
combination  of  point  targets  and  reciprocal  distrib¬ 
uted  targets  are  developed.  From  a  distributed 
target,  a  matrix  equation  is  derived  which  can  be 
converted  into  an  equivalent  point  target  response. 
The  equivalent  point  target  corresponds  physically 
to  a  90-degree  polarization  rotator  which  is  non- 
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reciprocal  and  rotation-invariant.  Due  to  this 
equivalent-point-target  interpretation,  the  cases  of 
polarimetric  calibration  using  two  point  targets  and 
one  distributed  target  then  reduce  to  those  using 
three  point  targets,  which  have  been  solved  in  pre¬ 
vious  research.  Regarding  the  calibration  using  one 
point  target  and  one  reciprocal  distributed  target, 
two  cases  are  analyzed  with  the  point  target  being 
a  sphere  (triher'  jl  reflector)  or  a  polarimetric  active 
radar  calibrator  (PARC).  For  both  cases,  the 
general  solutions  of  system  distortion  matrices  are 
written  as  a  product  of  a  particular  solution  and  a 
matrix  with  one  free  parameter,  and  then  an  addi¬ 
tional  assumption  about  the  distributed  target  is 
made  to  determine  the  free  parameter.  For  the 
trihedral-reflector  case,  when  the  particular  solution 
is  applied  for  calibration,  the  measured  polarimetric 
data  can  be  calibrated  to  the  level  that  only  rotation 
errors  remain.  When  azimuthal  symmetry  is 
assumed  for  the  distributed  target,  an  iterative 
scheme  is  devised  to  solve  the  rotation  parameter. 
For  the  PARC  case,  the  residual  error  is  similar  to 
the  channel  imbalance  after  the  data  are  calibrated 
by  the  particular  solution,  and  the  free  pQ.'ameter 
can  be  determined  by  knowing  one  ratio  o'  two 
covariance  matrix  elements  of  the  distributed  target. 
Numerical  results  were  simulated  to  demonstrate 
the  validity  of  the  algorithmr  developed. 

A  branching  model  is  proposed  for  the  remote 
sensing  of  vegetation.  The  frequency  and  angular 
responses  of  a  two-scale  cylinder  cluster  are  calcu¬ 
lated  to  demonstrate  the  significance  of  vegetation 
architecture.  The  results  indicate  that  it  is  necessary 
fc"  theoretical  models  to  take  into  account  the  archi¬ 
tecture  of  vegetation  which  plays  an  important  role 
in  determining  the  observed  coherent  effects.  A 
two-scale  branching  model  is  implemented  for 
soybean  with  its  internal  structure  and  the  resulting 
clustering  effects  considered.  Furthermore,  at  the 
scale  of  soybean  fields,  the  relative  location  of 
soybean  plants  is  described  by  a  pair  of  distribution 
functions.  The  polarimetric  backscattering  coeffi¬ 
cients  are  obtained  in  terms  of  the  scattering  prop¬ 
erties  of  soybean  plants  and  the  pair  distribution 
function.  Theoretical  backscattering  coefficients 
evaluated  using  the  hole-correction  pair  distubution 
are  in  good  agreement  with  extensive  data  col¬ 
lected  from  soybean  fields.  Compared  with  the 
independent-scatterer  pair  distribution,  it  is  found 
that  the  hole-correction  approximation,  which  pre¬ 
vents  two  soybean  plants  from  overlapping  each 
other,  is  more  realistic  and  improves  the  agreement 
between  the  model  calculation  and  experimental 
data  near  normal  incidence.  Extension  to  a  multi- 
scale  branching  model  can  be  achieved  by 
recursion. 


=ully  polarimetric  scattering  of  electromagnetic 
waves  from  snow  and  ice  is  studied  with  a  multi¬ 
layered  random  medium  model  and  applied  to  inter¬ 
pret  experimental  data  obtained  under  laboratory 
controlled  conditions  such  as  CRRELEX.  The  snow 
layer  is  modeled  as  an  isotropic  random  medium. 
The  sea  ice  is  described  as  an  anisotropic  random 
medium  due  to  the  nonsphencal  shape  of  brine 
inclusions.  The  underlying  sea  water  is  considered 
as  a  homogeneous  half-space.  The  random  media 
in  both  layers  are  characterized  by  three- 
dimensional  correlation  functions  with  variances  and 
correlation  lengths  corresponding  to  the  fluctuation 
strengths  and  the  physical  geometries  of  the  inho¬ 
mogeneities,  respectively.  The  strong  fluctuation 
theory  is  used  to  calculate  the  effective 
permittivities  of  the  random  media.  The  distorted 
Born  approximation  is  then  employed  to  obtain  the 
covariance  matrix  which  represents  the  fully 
polarimetric  scattering  properties  of  the  snow-ice 
media.  It  has  been  shown  that  the  polarimetric 
covariance  matrix  contains  more  information  than 
the  conventional  scattering  coefficients  on  the 
remotely  sensed  media. 

In  a  saline  ice  sheet  under  quiescent  condition,  the 
background  ice  grows  in  columnar  form  and  saline 
water  is  trapped  between  ice  platelets  in  the  form  of 
brine  inclusions  which  are  usually  ellipsoidal.  The 
ice  tends  to  grow  vertically  downward  rendering  the 
ellipsoidal  inclusions  aligned  preferably  in  the  ver¬ 
tical  direction  and  the  crystallographic  C  axes  par¬ 
allel  to  the  horizontal  plane.  In  this  case,  the  C  axes 
are,  however,  random  in  the  azimuthal  direction. 
The  strong  fluctuation  theory  is  extended  to  account 
for  vertically  aligned  ellipsoidal  brine  inclusions  with 
C  axes  randomly  oriented  in  the  horizontal  direc¬ 
tion.  The  brine  inclusions  are  described  by  three- 
dimensional  local  correlation  functions.  The 
configuration  average  over  the  azimuthal  orientation 
angles  is  carried  out  in  the  process  of  deriving  the 
global  correlation  tensor.  The  distorted  Born 
approximation  is  applied  to  obtain  the  covariance 
matrix  for  the  multilayered  snow-ice  configuration. 
The  theoretical  results  show  non-zero  cross- 
polarized  returns  under  the  first-order  distorted  Born 
approximation.  We  have  also  compared  the  results 
with  experimental  data  obtained  by  the  U.S.  Army 
Cold  Regions  Research  and  Engineering  Labora¬ 
tory  (CRREL). 

A  multivariate  K-distribution  has  been  developed  to 
model  the  statistics  of  fully  polarimetric  radar  data 
from  earth  terrain  with  polarizations  HH,  HV,  VH, 
and  VV.  In  this  approach,  correlated  polarizations  of 
radar  signals,  as  characterized  by  a  covariance 
matrix,  are  treated  as  the  sum  of  N  n-dimensional 
random  vectors;  N  obeys  the  negative  birigmial  dis¬ 
tribution  with  a  parameter  a  and  mean  N.  Subse- 
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quently,  an  n-dimensional  K-distribution,  with  e'ther 
zero  or  nonzero  mean,  is  developed  in  the  limit  of 
infinite  N  or  illuminated  area.  The  probability  density 
function  (PDF)  of  the  K-distributed  vector  normal¬ 
ized  by  its  Euclidean  norm  is  independent  of  the 
parameter  a  and  is  the  same  as  that  derived  from  a 
zero-mean  Gaussian-distributed  random  vector.  The 
above  model  is  well  supported  by  experimental  data 
provided  by  MIT  Lincoln  Laboratory  and  the  Jet 
Propulsion  Laboratory  in  the  form  of  polarimetric 
measurements.  The  results  are  illustrated  by  com¬ 
paring  the  higher-order  normalized  intensity 
moments  and  cumulative  density  functions  (CDF)  of 
the  experimental  data  with  theoretical  results  of  the 
K-distribution. 

Among  the  various  theoretical  models  applied  to 
study  the  electromagnetic  wave  scatterings  from 
geophysical  terrain,  such  as  snow  and  ice,  the 
radiative  transfer  theory  has  drawn  intensive  atten¬ 
tion  in  the  microwave  remoie  sensing  community 
during  the  past  years.  In  most  of  the  scattering 
models,  the  volume  scattering  and  ‘he  surface  scat¬ 
tering  effects  have  been  investigated  separately. 
Recently,  there  has  been  a  growing  interest  in  the 
construction  of  composite  models  which  can  take 
into  account  both  types  of  scattering.  We  derived 
the  first  order  iterative  solution  to  the  vector 
radiative  transfer  equations  for  a  two-layer  medium 
with  a  diffuse  top  boundary  and  an  irregular  bottom 
boundary  of  Gaussian  roughness.  The  Kirchhoff 
approximation  and  the  geometrical  optics  approach 
with  shadowing  correction  are  used  in  formulating 
the  boundary  conditions.  To  demonstrate  the  utili¬ 
ties  of  the  theory,  randomly  oriented  spheroidal  dis¬ 
crete  scatte  er  model  is  used  to  calculate  the 
backscattering  coefficients  from  a  soybean  field  in 
different  growing  stages  and  then  compared  to  the 
experimental  measurements.  Good  agreement  has 
been  achieved  for  both  the  co-polarized  and  the 
cross-polarized  data.  It  is  observed  that  the  pres¬ 
ence  of  the  rough  surface  can  significantly  enhance 
the  backscattering  at  small  incident  angles  and 
increase  the  cross-polarized  returns.  The  polariza¬ 
tion  signatures  calculated  based  on  the  Mueller 
matrix  show  a  straight  distortion  track  and  an 
observable  pedestal.  Numerical  comparison  to  the 
backscattering  coefficients  calculated  by  using 
planar  bottom  boundary  conditions  with  or  without 
the  incoherent  addition  of  the  rough  surface  effects 
are  also  made. 

The  concept  of  polarimetry  in  active  remote  sensing 
is  extended  to  passive  remote  sensing.  The  poten¬ 
tial  use  of  the  third  and  fourth  Stokes  parameters  U 
and  V,  which  play  an  important  role  in  polarimetric 
active  remote  sensing,  is  demonstrated  for  passive 
remote  sensing.  It  is  shown  that,  by  the  use  of  the 


reciprocity  principle,  the  polarimetric  parameters  of 
passive  remote  sensing  can  be  obtained  through 
the  solution  of  the  associated  direct  scattering 
problem.  In  particular,  the  full  polarimetric  informa¬ 
tion,  including  the  corresponding  brightness  temper¬ 
atures  of  U  and  V,  can  be  obtained  from  the 
solution  of  the  direct  scattering  problem  for  four  dif¬ 
ferent  polarizations  of  the  incident  wave.  These 
ideas  are  applied  to  study  polarimetric  passive 
remote  sensing  of  periodic  surfaces.  The  solution  of 
the  direct  scattc'-ng  problem  is  obtained  by  an  inte¬ 
gral  equation  formulation.  Incidence  on  a 
penetrable,  lossy  medium  is  considered.  Since  the 
kernels  of  the  integral  equations  are  the  periodic 
Green's  functions  and  I'leir  normal  derivatives  on 
the  surface,  rapid  evaluation  of  the  slowly  conver¬ 
gent  series  associated  with  these  functions  is  crit¬ 
ical  for  the  method  to  be  feasible.  The  study  has 
shown  that  the  brightness  temperature  of  the 
Stokes  parameter  U  can  be  significant  in  passive 
remote  sensing.  Values  as  high  as  50  K  are 
obsen/ed  for  certain  configurations. 

To  demonstrate  the  use  of  polarimetry  in  passive 
remote  sensing  of  azimuthally  asymmetric  features 
on  a  terrain  surface,  an  experiment  was  designed 
and  implemented.  A  triangular  corrugation  pattern 
was  made  on  the  sandy  soil  surface.  Polarimetric 
brightness  temperatures  are  measured  with  hori¬ 
zontal,  vertical,  and  45°  polarization  orientations  for 
various  observation  angles.  From  the  measured 
temperatures,  absolute  values  as  high  as  30-40  K 
of  the  third  Stokes  brightness  temperatures  are 
observed.  A  theoretical  analysis  of  the  data  indi¬ 
cates  that  the  high  values  of  U  are  caused  by  the 
azimuthal  asymmetry  on  the  remotely  sensed  soil 
surface.  It  is  also  observed  from  the  experiment 
that  the  brightness  temperatures  for  all  three  Stokes 
parameters  vary  as  the  observation  direction  varies 
from  being  parallel  to  the  surface  row  structure  to 
being  perpendicular  to  the  row  structure.  The  signif¬ 
icant  implies*' jn  of  this  experiment  is  that  the 
surface  asymmetry  can  be  detected  with  a  meas¬ 
urement  of  U  at  a  single  azimuthal  angle. 
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Recent  studies  on  electromagnetic  emissions  have 
identified  the  significance  of  resonance  in  the  radi¬ 
ation  and  re-radiation  of  printed  wiring  board  (PWB) 
components  such  as  transmission  lines  and 
heatsinks.  This  has  drawn  attention  to  other  com¬ 
ponents  within  a  computer  system  which  are  of 
larger  dimensions  and  which  are  likely  candidates 
to  resonate  within  the  frequency  range  of  emissions 
tests.  An  arrangement  in  computer  systems  which 
falls  into  this  category  is  the  modules-on-backplane 
configuration.  Discussions  with  DEC  engineers 
have  identified  the  voltage  fluctuations  across  the 
relatively  high  impedance  reference  connections, 
between  a  reference  plane  within  a  module  and  that 
on  the  backplane,  as  a  likely  energy  source  for 
electromagnetic  emissions.  The  array  airangement 
of  multiple  modules  on  a  backplane  raises  the  pos¬ 
sibility  of  parallel  plate  resonances  where  the  reso¬ 
nance  frequencies,  to  leading  order,  are  inversely 
proportional  to  the  dimensions  of  the  module 
planes.  The  large  dimensions  of  the  module  planes 
relative  to  the  signal  etch  runs,  heatsink  dimen¬ 
sions,  etc.,  imply  that  any  resonance  effects  may  be 
manifested  at  lower  frequencies. 

There  is  common  perception  that  in  the  printed 
wiring  board  (PWB)  environment  stripline  configura¬ 
tions  will  generate  lower  emission  levels  than  micro¬ 
strip  structures.  In  previous  years,  the  radiation 
properties  of  both  microstrip  and  stripline  structures 
of  finite  size  were  quantitatively  compared.  The 
examination  of  conducting  structures  which  may 
model  shielding  enclosures  on  the  radiating  proper¬ 
ties  of  the  two  structures  is  of  interest,  as  these 
configurations  correspond  to  the  case  where 
modules  are  placed  in  close  proximity  to  the 
shielding  enclosure  common  in  computer  systems. 

Metallic  enclosures  are  commonly  used  to  house 
computer  systems.  The  electromagnetic  field  dis¬ 
tribution  and  intensities  which  result  from  sources 
within  the  enclosure  are  of  concern  because  of  the 
possibility  of  strong  fields  existing  due  to  the  reson¬ 
ator  structure  formed  by  the  metallic  enclosure. 
The  effects  of  adding  resistive  material  of  varying 
dimensions,  resistivities,  and  positions  to  the  enclo¬ 
sure  is  also  examined.  The  study  will  attempt  to 


determine  the  optimum  parameters  for  the  resistive 
material  which  will  minimize  the  electromagnetic 
field  intensity  within  the  enclosure.  Perforations 
(holes,  slots)  in  the  metallic  enclosure  and  the  cor¬ 
responding  radiated  power  are  also  considered. 

For  each  configuration,  field  strengths,  radiated 
power  levels,  and  resonance  frequencies  (which  are 
crucial  for  the  radiation  mechanism)  are  being 
investigated.  This  study  will  benefit  the  design  of 
integrated  circuit  interconnections  to  meet  regula¬ 
tory  limits  on  electromagnetic  emission  levels.  It 
reflects  the  progression  from  examining  module 
interconnect  and  component  radiation  in  relative 
isolation  (performed  in  the  last  two  contract  years) 
to  more  realistic  situations  which  incorporate  inter¬ 
action  with  other  conducting  structures  in  close 
proximity,  as  is  the  case  at  the  system  level. 

The  finite  difference-time  domain  (FD-TD)  technique 
is  applied  to  the  solution  of  Maxwell's  equations.  A 
computer  program,  which  can  be  used  to  simulate 
and  study  numerous  electromagnetic  phenomena, 
is  developed  and  implemented  on  an  IBM 
386-compatible  personal  computer.  The  FD-TD 
technique  is  a  useful  tool  for  students  in  electro¬ 
magnetics.  The  technique  is  flexible  and  can  be 
applied  to  many  basic  EM  scattering  and  radiation 
problems.  Because  field  solutions  are  found  as  a 
function  of  time,  visualization  of  the  propagation  of 
the  EM  fields  is  possible.  The  FD-TD  technique  is 
implemented  for  a  two-dimensional  rectangular  grid 
in  conjunction  with  a  second-order  absorbing 
boundary  condition.  Both  E-  and  H-field  polariza¬ 
tions  are  analyzed.  Finite  objects  consisting  of 
dielectric,  magnetic  and  conducting  materials,  and 
perfectly  conducting  infinite  ground  planes  are 
modeled.  Plane  wave  and  line  current  sources  are 
implemented.  In  addition  to  the  capability  of  ani¬ 
mating  the  propagation  of  the  EM  fields,  radiation 
and  scattering  patterns  can  be  generated. 

A  methodology  developed  to  handle  dispersive 
materials  in  the  time  domain  is  extended  to  model 
the  dispersive  characteristics  of  the  impedance 
boundary  condition  used  for  a  thin  layer  coating 
over  perfect  conductors.  The  impedance  boundary 
condition  is  first  approximated  as  a  rational  function 
of  frequency.  This  rational  function  is  then  trans¬ 
formed  to  a  time  domain  equation,  resulting  in  a 
partial  differential  equation  in  space  and  time. 
Discretization  of  the  time  domain  model  to  effi¬ 
ciently  handle  the  thin  layer  coating  is  presented  in 
the  context  of  the  finite-difference  time-domain 
(FD-TD)  technique.  The  methodology  is  verified  by 
solving  a  one-dimens  o"  h  problem  using  the  FD-TD 
technique  and  comp  ?'  •  ;;  it  with  the  analytical 
results. 
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Precision  landing  system  plays  an  important  role  in 
continuing  air  travel  services  during  adverse 
weather  conditions.  Instrument  Landing  System 
(ILS)  has  been  in  service  for  the  past  five  decades, 
and  microwave  landing  system  (MLS)  will  be  gradu¬ 
ally  replacing  ILS  within  the  next  decade.  These 
two  systems  require  ground  equipments  to  work 
properly;  therefore,  their  usage  is  limited  to  where 
such  setup  is  available.  Two  alternative  technolo¬ 
gies  that  do  not  require  ground  setups  have  been 
proposed  as  the  future  replacement  for  ILS  and 
MLS:  the  synthetic  vision  (SV)  sensors  and  the 
global  positioning  system  (GPS). 

In  this  project  we  have  developed  a  computer  simu¬ 
lation  tool  EMSALS  which  uses  the  databases  of 
navigational  and  commercial  radiowave  transmitters 
to  calculate  the  strengths  of  desired  and  undesired 
signals,  and  generates  an  EM  interference  assess¬ 
ment  based  on  user-selected  assumptions.  The 
first  part  of  the  software  EMSALS/I  (for  ILS)  has 
been  used  to  analyze  the  frequency  congestion  and 
electromagnetic  interference  problems  in  the  conti¬ 
nental  United  States  given  the  assumptions  of  the 
current  FAA  standard  interference  protections. 
Emphasis  was  first  placed  on  ten  complex  metro¬ 
politan  areas  with  high  densities  of  runways.  Then 
the  less  densely  populated  regions  were  analyzed 
to  determine  ILS  capacity  across  the  entire  conti¬ 
nental  United  States  (CONUS). 

The  second  part  of  the  software,  EMSALS/M  (for 
MLS),  was  used  to  evaluate  channel  availability  and 
to  generate  a  frequency  assignment  plan  given  the 
assumptions  of  the  current  ICAO  MLS  channel 
assignment  rules  and  interference  protection  stan¬ 
dards.  The  channel  assignments  had  to  be  carried 
out  for  ILS-to-MLS  conversion  sites  as  well  as  new 
candidates,  including  heliports  which  cannot  be 
sited  with  ILS.  The  total  number  of  candidate  sites 
is  over  1800. 


The  instrument  landing  system  (ILS)  is  the  standard 
precision  landing  aid  used  by  thousands  of  airline 
carriers  and  general  aviation  aircraft.  In  recent 
years,  there  have  been  growing  concerns  over 
potential  hazzards  from  FM-broadcast  interference. 
The  ILS  localizer,  which  provides  lateral  guidance 
during  the  aircraft  landing  phase,  has  its  frequency 
band  (108-112  MHz)  located  right  above  the  com¬ 
mercial  FM-broadcasting  stations  (88-108  MHz). 
Because  the  effective  radiated  power  of  a  localizer 
transmitter  is  typically  around  15  W,  as  compared 
to  10-100  kW  for  an  FM  station,  it  is  possible  for 
FM  signals  to  overload  the  airborne  receiver 
front-end  or  to  drive  it  out  of  specifications  and 
cause  intermodulation  interference. 

We  developed  a  scheme  to  link  the  interference 
immunity  with  the  risk  allowance  for  instrument 
landing  operations.  The  risk  allowance  is  derived 
as  a  function  of  the  aircraft  position.  By  the  very 
nature  of  landing,  the  allowable  risk  from  interfer¬ 
ence  decreases  or  becomes  more  stringent  as  the 
aircraft  approach  the  decision  height.  We  also 
establish  the  composite  probability  of  interference 
from  the  statistical  models  of  systems  involved, 
including  the  localizer  transmitter  power,  FM  station 
power,  propagation  factors,  antenna  directivity,  and 
variations  in  airborne  receivers.  By  comparing  the 
composite  probability  of  interference  with  the  risk 
allowance  at  each  point  within  the  service  volume 
of  ILS  localizer,  we  can  then  determine  whether 
there  is  harmful  interference  from  the  FM  stations. 
A  computer  program  has  been  developed  for  the 
model. 

Electromagnetic  interference  is  one  of  the  factors 
that  can  adversely  affect  GPS  receiver  functioning 
during  the  critical  phase  of  final  approach  and 
landing.  Although  no  actual  incidence  has  been 
reported,  in  theory  the  threat  from  a  moderately  low 
power  interference  source  does  exist.  In  this  study, 
we  conducted  experimental  investigation  of  SPS 
GPS  receiver  susceptibility  to  electromagnetic  inter¬ 
ference.  An  aviation-grade  GPS  receiver  mounted 
on  the  roof  of  a  six-story  building  was  subject  to 
on-channel  and  off-channel  CW  and  narrow-band 
noise  with  varying  power  levels.  The  interfering 
signal  was  transmitted  over  the  open  space  to 
reach  GPS  antenna.  We  used  a  combination  of 
IBM  PC  and  receiver  front-panel  display  to  contin¬ 
uously  record  GPS  data  and  warning  flag  for  the 
cases  with  and  without  jammers.  Taking  the 
receiver  front-end  as  the  reference  point,  we 
obtained  the  interference  power  level  that  causes 
the  GPS  receiver  to  lose  lock  and  become  unable 
to  provide  accurate  position  data.  This  level  is  then 
converted  to  antenna  receiving  power,  from  which 
we  estimate  the  effective  radiation  power  level 
required  for  interferers  at  a  distance. 
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Kohler,  T.  Nguyen,  H.  Nguyen,  and  J.  Ho. 
"Applications  of  EMSARS  Background  Model  to 
MMW  Vegetation  Scattering  Simulation."  Paper 
presented  at  the  Ground  Target  Modeiing  and 
Validation  (GTMV)  Conference,  Marquette, 
Michigan,  August  17-19, 1992. 

Kong,  J.A.  "Progress  in  Electromagnetics  and 
Future  Developments."  Paper  presented  at  the 
International  Electromagnetic  Compatibility  Sym¬ 
posium,  Singapore,  December  7-9,  1992. 

Kong,  J.A.  "Theoretical  Modeling  for  Passive 
Microwave  Remote  Sensing  of  Earth  Terrain." 
Paper  presented  at  the  URSI  Specialists 
Meeting  on  Microwave  Radiometry  and  Remote 
Sensing,  Boulder,  Colorado,  January  24-26, 
1992. 

Kong,  J.A.  "Characterization  of  Earth  Terrain  Mate¬ 
rial  as  Random  Media  Applied  to  Remote 
Sensing."  Paper  presented  at  the  International 
Electromagnetic  Compatibility  Symposium, 
Singapore,  December  7-9,  1992. 

Nghiem,  S.V.,  R.  Kwok,  J.A.  Kong,  R.T.  Shin,  A.J. 
Gow,  and  S.A.  Arcone.  "Effective  Permittivity  of 
Saline  Ice  Under  Thermal  Variation."  Paper  pre¬ 
sented  at  the  International  Geoscience  and 
Remote  Sensing  Symposium,  Houston,  Texas, 
May  26-29,  1992. 

Xia,  J.,  and  J.A.  Kong.  "Electromagnetic  Profile 
Reconstruction  using  the  Riccati  Equation 
Approach."  Paper  presented  at  the  IEEE  APS 
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International  Symposium,  Chicago,  Illinois,  July 
18-25,  1992. 

Xia,  J.,  T.M.  Habashy,  R.T.  Shin,  and  J.A.  Kong. 
"The  Renormalized  STIE  Aproach  Applied  to 
Inversion  of  Soil  Moisture  Profiles."  Paper  pre¬ 
sented  at  the  National  Radio  Science  Meeting, 
Boulder,  Colorado,  January  7-10,  1992. 

Xia,  J.,  and  J.A.  Kong.  "Electromagnetic  Inverse 
Scattering  in  Remote  Sensing."  Paper  presented 
at  the  Optical  Society  of  America  Topical 
Meeting  on  Signal  Recovery  and  Synthesis, 
New  Orleans,  Louisiana,  April  14-15,  1992. 

1.5  Superconducting  Transmission 
Lines 

1.5.1  Simulations  of  Vortices  in  Arrays  of 
Josephson  Junctions 

Sponsors 

DARPA/Consortium  for  Superconducting 
Electronics 

Contract  MDA972-90-C-0021 
National  Science  Foundation 
Fellowship  MIP  88-58764 

Project  Staff 

Joel  R.  Phillips,  Herre  S.J.  van  der  Zant,  Robert 
Bock,  Professor  Terry  P.  Orlando,  Professor  Jacob 
K.  White 

Vortices  play  a  central  role  in  determining  the  static 
and  dynamic  properties  of  two-dimensional  (2D) 
superconductors.  Artificially  fabricated  networks  of 
superconducting  islands  weakly  coupled  by 
Josephson  junctions  are  model  systems  for 
studying  the  behavior  of  vortices.  These  arrays 
have  also  been  used  to  study  the  superconducting- 
insulator  transition.  Giant  Shapiro  steps,  and  the 
Kosterlitz-Thouless-Berezinskii  (KTB)  transition. 

Studies  of  vortices  in  Josephson  junction  arrays 
generally  neglect  magnetic  fields  induced  by  cur¬ 
rents  flowing  in  the  array.  It  is  assumed  that  the 
penetration  depth  for  flux  X  is  much  larger  than  the 
size  of  the  array.  With  the  present  SNAP  tech¬ 
nology,  all  niobium  arrays  have  been  made  with  X 
of  the  order  of  the  cell  size;  therefore,  effect  of 
induced  fields  must  be  considered  for  an  accurate 
description  of  these  systems. 

We  use  numerical  simulation  to  investigate  how  a 
variety  of  vortex  static  properties  are  affected  by 


finite  penetration  depth  X,  and  we  calculate  for  the 
first  time  the  self-consistent  current  and  magnetic 
fields  from  a  vortex  in  a  2D  array.  We  find  that  in 
order  to  calculate  the  correct  current  and  field  dis¬ 
tributions,  the  full  3D  behavior  of  the  magnetic  fields 
must  be  accounted  for  by  including  nearly  all 
mutual  inductance  terms.  However,  to  calculate  the 
energy  barrier  for  cell  to  cell  vortex  motion,  which 
was  first  shown  by  Lobb,  Abraham,  and  Tinkham 
(LAT)  to  be  0.2  Ej,  where  Ej  is  the  Josephson  cou¬ 
pling  energy,  including  only  self  and  nearest 
neighbor  inductances  is  sufficient.  The  LAT  calcu¬ 
lation  neglected  induced  magnetic  fields.  We  show 
that  induced  fields  may  increase  the  energy  barrier 
substantially  above  0.2  Ej.  Our  calculations  also 
show  that  the  thermodynamic  lower  critical  field  of 
the  array  is  enhanced  when  the  computation  self- 
consistently  accounts  for  induced  magnetic  fields. 
By  using  only  a  self-inductance  term  to  model  the 
induced  fields,  the  lower  critical  field  is  overesti¬ 
mated. 

Self-consistently  determining  the  currents  and  fields 
in  a  Josephson  junction  array  is  a  difficult  numerical 
problem.  This  is  because  the  fields  induced  by  a 
junction  current  affect  the  current  through  every 
junction  in  the  array.  This  dense  interaction  implies 
that  for  an  array  having  N  cells,  on  the  order  of  N^ 
words  of  computer  memory  are  required  just  to 
store  the  mutual  inductance  matrix.  If  a  direct 
approach  is  used  to  compute  the  self-consistent 
solution  by  factoring  the  inductance  matrix,  on  the 
order  of  N^  operations  are  required.  For  example, 
the  inductance  matrix  of  a  100  x  100  array  requires 
more  than  400  megabytes  of  memory  to  store  and 
on  the  order  of  10’^  operations  to  factor. 

To  make  the  computation  tractable,  we  derive  a 
novel  simulation  algorithm  which  combines  several 
numerical  techniques  with  an  appropriate  problem 
formulation.  When  used  to  simulate  an  N-cell  array, 
this  approach  reduces  the  storage  required  to  order 
N  and  decrease  the  computation  time  to  order  N 
log  N.  This  approach  makes  it  possible  to  compute 
self-consistent  array  currents  and  fields  in  a  100  x 
100  array  in  a  minute  or  so  using  a  scientific  work¬ 
station  (IBM  RS6000). 


1.5.2  NonLinear  Effects  in 
Superconducting  Transmission  Lines 

Sponsors 

DARPA/Consortium  for  Superconducting 
Electronics 

Contract  MDA972-90-C-0021 
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Project  Staff 

Cheung-Wei  Lam,  Dr.  Sami  M.  Ali,  and  Professor 
Terry  P.  Orlando  in  collaboration  with  Daniel  E. 
Oates® 

The  Ginzburg-Landau  theory  has  been  applied  to 
model  the  nonlinear  behavior  of  superconducting 
transmission  lines,  including  striplines  and  micro¬ 
strip  lines. 

The  Ginzburg-Landau  differential  equations  are 
solved  numerically  inside  the  signal  line  where  high 
current  density  exists.  The  resulting  distribution  of 
the  magnetic  penetration  depth  and  conductivity  is 
then  used  to  calculate  the  nonlinear  inductance  and 
resistance  of  the  superconducting  transmission 
lines.  Numerical  results  are  compared  with 
measurements  on  NbN  and  YBCO  stripline  reson¬ 
ators.  Good  agreement  is  obtained  for  the  non¬ 
linear  shift  of  resonant  frequency.  However,  the 
measured  Q  factor  appears  to  be  much  more  non¬ 
linear  than  predicted  by  the  numerical  model.  It  is 
believed  that  some  other  nonlinear  loss  mechanism 
is  dominating  over  that  caused  by  pair-breaking. 

This  observation  motivates  us  to  investigate  the 
nonlinear  loss  caused  by  vortex  motions.  A  non¬ 
linear  diffusion  equation  for  the  magnetic  flux  is  for¬ 
mulated  for  the  flux  flow  dynamics.  Appropriate 
models  for  the  nonlinear  diffusion  coefficient  will  be 
adopted  for  the  numerical  calculation  of  flux  flow  in 
a  Superconducting  slab. 


1.5.3  Superconducting  Transmission  Lines 
Sponsor 

DARPA/Consortium  for  Superconducting 
Electronics 

Contract  MDA972-90-C-0021 

Project  Staff 

Laurence  H.  Lee,  Dr.  Sami  M.  Ali,  Professor  Terry 
P.  Orlando  in  collaboration  with  W.  Gregory  Lyons® 

A  full-wave  spectral-domain  volume-integrai- 
equation  method  has  been  developed  to  analyze 
various  configurations  of  superconducting  trans¬ 
mission  lines;  microstrip  lines,  striplines,  coplanar 
strips,  and  coplanar  waveguides.  In  the  formu¬ 
lation,  a  spectral-domain  Green's  function  for 
isotropic,  layered  media  is  used  to  set  up  an  inte¬ 


gral  equation  for  the  electric  field  inside  the  super¬ 
conducting  strips.  Galerkin's  method  with  roof-top 
basis  functions  is  employed  to  solve  for  the 
complex  propagation  constant  and  current  distribu¬ 
tion.  The  characteristic  impedance  of  the  structures 
is  then  obtained  from  transmission  line  theory.  This 
method  rigorously  accounts  for  the  anisotropy  and 
the  finite  thickness  of  the  superconducting  films, 
yielding  accurate  characterization  for  the  loss  and 
kinetic  effect  of  the  superconductors.  However,  this 
technique  is  computationally  inefficient. 

To  implement  an  efficient  method,  an  equivalent 
surface  impedance  is  used  to  transform  the  super¬ 
conducting  strip  with  finite  thickness  to  an  infinitely 
thin  strip.  This  equivalent  surface  impedance 
accounts  for  the  loss  and  kinetic  inductance  of  the 
superconductors.  An  empirical  formula  for  the 
current  distribution  in  a  thin  superconducting  film  is 
determined  for  the  derivation  of  the  equivalent 
surface  impedance.  To  include  effects  of  aniso¬ 
tropic  substrates,  a  2D  dyadic  Green's  function  for 
anisotropic,  layered  media  is  used  to  formulate  an 
integral  equation  for  the  surface  current.  Galerkin's 
method  with  entire-domain  basis  functions  is  used 
to  solve  for  the  complex  propagation  constant  and 
the  surface  current.  The  characteristic  impedance 
is  then  calculated  using  the  power-current  definition. 
This  method  has  been  used  to  analyze  supercon¬ 
ducting  single  and  coupled  microstrip  lines  on 
anisotropic  substrates. 

We  have  been  successfully  measuring  the  long 
wavelength  infrared  quantum  well  (LWIR)  intersub¬ 
band  absorption  in  2  to  5  txm  wavelength  region 
based  on  InAs/InGaAs/AIAs  resonant  tunneling 
diode  structures  for  the  IR  detectors  application. 
Over  40  percent  intersubband  absorption  can  be 
achieved  from  a  doped  single  quantum  well  through 
our  delicated  designed  waveguiding  system,  where 
IR  is  focused  upon  the  sample  edge  and  makes  20 
times  total  internal  reflection  while  it  propagates 
along  the  samples  and  dramatically  increases  the 
absorption  strength.  We  also  demonstrate  both  the¬ 
oretically  and  experimentally  that  TE  as  well  as  TM 
modes  can  excite  the  intersubband  absorption.  We 
also  pointed  out  the  interface  Fe  interband  absorp¬ 
tion  at  0.3  eV  could  seriously  obscure  the  IR 
absorption  spectra  and  have  shown  how  they  can 
be  avoided.  These  results,  applied  in  concert  with 
recently  developed  selective  etches  that  allow  us  to 
make  electrical  contact  to  the  quantum  well  directly 
and  open  the  way  to  realizing  high  performance 
LWIR  detectors. 


8  MIT  Lincoln  Laboratory. 
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Professor  Bernard  F.  Burke,  Professor  David  H.  Staelin,  Professor  Jacqueline  N.  Hewitt,  Dr.  Philip  W. 
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Visiting  Scientists  and  Research  Affiiiates 
Dr.  Alan  Wright' 

Graduate  Students 

Ashraf  S.  Alkhairy,  Ian  M.  Avruch,  Deborah  J.  Becker,  Carlos  R.  Cabrera,  Grace  H.  Chen,  Kevin  G. 
Christian,  Samuel  R.  Conner,  Sheperd  S.  Doeleman,  John  D.  Ellithorpe,  Paul  W.  Fieguth,  Andr6  B.  Fletcher, 
Matthew  K.  Gray,  Mark  R.  Griffith,  Lori  K.  Herold,  Charles  A.  Katz,  Pauline  McMahon,  Mark  D.  Messier, 
Christopher  B.  Moore,  Michael  J.  Schwartz,  Jie  Sun 
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1.1  Extragalactic  Radio  Source  Studies 

Sponsor 

National  Science  Foundation 
Grant  AST  90-22501 

Project  Staff 

Professor  Bernard  F.  Burke,  Jacqueline  N.  Hewitt, 
John  W.  Barrett,  Ian  M.  Avruch,  Deborah  J.  Becker, 
Samuel  R.  Conner,  Sheperd  S.  Doeleman,  Andr^ 
B.  Fletcher,  Mark  R.  Griffith,  Lori  K.  Herold, 

Three  major  areas  of  research  were  covered  by  the 
program  under  Professor  Burke's  direction.  Two  of 
these  were  in  the  field  of  radio  astronomy:  the 
search  for  gravitational  lenses  (two  new  examples 
have  been  discovered  by  our  group)  and  a  sunrey 
of  radio  sources  in  the  Southern  Hemisphere  that 
increased  the  number  of  known  radio  sources  there 
by  nearly  an  order  of  magnitude.  The  third  is  the 
opening  of  a  new  field  of  research:  the  search  for 
and  study  of  planets  of  other  stars  known  as 
"exoplanets." 


1.1.1  Gravitational  Lens  Studies 

The  study  of  the  gravitational  lensing  phenomenon, 
which  occurs  when  a  distant  object  such  as  a 
quasar  has  its  image  distorted  by  a  foreground 
mass  such  as  a  galaxy,  presents  a  new  tool  for 
investigating  gravitational  interaction  and 
cosmology.  In  particular,  the  distribution  of  all 
matter  in  a  galaxy  can  be  studied,  whether  it  is 
luminous  or  dark,  baryonic  or  non-baryonic.  Two 
new  examples  of  gravitational  lensing  were  discov¬ 
ered  during  the  past  two  years  of  the  program.  In 
the  course  of  this  work,  it  has  become  clear  that 
there  are  systematic  classes  of  lensing  which  may 
be  called  "Einstein  rings"  and  "Einstein  quads." 
(The  first  example  of  an  Einstein  ring  was  discov¬ 
ered  earlier  by  Professor  Jacqueline  Hewitt  of  the 
MIT  Physics  Department  and  RLE.) 

The  work  uses  the  MIT-Green  Bank  (MG)  Survey 
as  a  finding  list,  mapping  a  large  number  of  these 
radio  sources  at  high  angular  resolution  with  the 
Very  Large  Array  (VLA)  and  then  pursuing  prom¬ 
ising  examples  with  further  optical  and  radio 
studies.  In  our  program,  over  6000  radio  sources 
have  been  mapped  with  the  VLA.  This  collection  is 
the  largest  sample  of  sub-arcsecond  radio  source 
maps  in  the  world,  and  it  can  be  used  for  numerous 
astronomical  investigations. 


'  Australian  National  Telescope  Facility,  Parkes,  Australia. 
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Space  Res.  1 1  (2) :  349-353  (1991). 

Burke,  B.F.,  D.H.  Roberts,  J.  Lehar,  and  J.N. 
Hewitt.  "The  Hubble  Constant  from  VLA  Meas¬ 
urement  of  the  Time  Delay  in  the  Double 
Quasar  0957+561."  Nature.  July  1991. 

Burke,  B.F.,  M.B.  Heflin,  M.V.  Gorenstein,  and  C.R. 
Lawrence.  "First  and  Second  Epoch  VLBI 
Observations  of  the  Gravitational  Lens  System 
2016+112."  Astrophys.J.  378:519-536  (1991). 

Burke,  B.F.  "Summary  of  the  Issues."  In  Frontiers 
of  VLBI:  Proceedings  of  the  International  VSOP 
Symposium  and  the  mm-Wave  VLBI  Workshop, 
Eds.  H.  Hirabyashi,  M.  Inoue,  and  H.  Kobayashi. 
Chapter  5:  Management  Plan.  Tokyo:  University 
Academy  Press,  1991. 

Burke,  B.F.  "A  View  of  the  Universe  Through 
Imperfect  Lenses:  A  Tribute  to  David 

Heeschen."  Talk  presented  at  the  Heeschen 
Retirement  party,  December  16,  1991. 

Burke,  B.F.,  J.  Leh^r,  and  S.  Conner.  "Einstein 
Rings  and  Related  Phenomena."  Paper  pre¬ 
sented  at  the  International  Conference  on  Gra¬ 
vitational  Lenses,  Hamburg,  Germany, 

September  1991. 

Burke,  B.F.,  J.  Leh^ir,  J.N.  Hewitt,  and  D.H. 
Roberts.  "The  Radio  Time  Delay  in  the  Double 
Quasar  0957+561."  Astrophys.  J.  384:453-466 
(1992). 

Burke,  B.F.,  S.R.  Conner,  and  J.  Leh^r.  "Recon¬ 
ciling  the  Image  Brightness  Ratios  in  the  Gra¬ 
vitational  Lens  System  0957+561."  Astrophys. 
J.  Lett.  387  LC:  61-64  (1992). 

Griffith,  M.,  G.  Langston,  M.  Heflin,  and  S.  Conner. 
"The  Fourth  MIT-Green  Bank  5  GHz  Survey." 
Astrophys.  J.  Suppl.  75:  801-833  (1991). 

Invited  Talks 

Burke,  B.F.  "Gravitational  Lenses."  Arecibo  Radio 
Observatory,  Puerto  Rico,  March  1992. 

Burke,  B.F.  "Gravitational  Lenses  in  the  Ultravi¬ 
olet."  University  College,  London,  June  1992. 

Burke,  B.F.  "Einstein  Rings  and  Einstein  Quads." 
lAU  Colloquium,  Manchester,  England,  July 
1992. 


1.1.2  The  ParkeS'MIT-NRAO  Southern 
Hemisphere  Survey 

The  Parkes-MIT-NRAO  survey  of  radio  sources  was 
concluded.  The  work  was  carried  out  with  the 
Parkes  210-foot  radio  telescope  using  the  National 
Radio  Astronomy  Observatory  (NRAO)  14-channel 
6  cm  receiver  by  Dr.  Alan  Wright  of  Australia  and 
Mr.  Mark  Griffith,  an  MIT  graduate  student.  We 
have  published  a  first  general  overview  paper,  and 
a  detailed  technique  paper  will  be  published  in  May 
1993. 

The  detailed  catalogue  is  ready  for  submission  and 
contains  a  list  of  approximately  60,000  radio 
sources  in  the  southern  sky.  This  is  an  increase  by 
nearly  an  order  of  magnitude  in  the  number  of 
known  Southern  Hemisphere  radio  sources,  and 
follow-up  work  is  continuing  with  our  Australian  col¬ 
leagues. 


1.2  The  Study  of  Exopianets 

Sponsor 

National  Aeronautics  and  Space  Administration 
Grant  NAGW  1386 

Project  Staff 

Professor  Bernard  F.  Burke,  Ian  M.  Avruch, 
Deborah  J.  Becker,  Samuel  R.  Conner,  Andr6  B. 
Fletcher,  Mark  R.  Griffith,  Lori  K.  Herold 

The  search  for  and  study  of  exoplanets,  including 
the  study  of  their  origin  and  evolution,  is  a  new 
research  area  that  has  become  a  NASA  program 
known  as  the  Toward  Other  Planetary  Systems 
(TOPS)  program.  In  1988,  Professor  Burke  was 
appointed  Chair  of  the  NASA  science  working 
group  that  formulated  this  program.  In  1992  NASA 
published  the  report  TOPS:  Toward  Other  Plane¬ 
tary  Systems.  During  the  past  decade.  Professor 
Burke  published  several  papers  that  treated  the 
possibility  of  searching  for  exoplanets. 

Publications 

Burke,  B.F.  "Concluding  Remarks:  Prospects  for 
the  Study  of  Planetary  Radio  Emission."  Third 
International  Workshop  on  Radio  Emissions 
from  Planetary  Magnetospheres,  Graz,  Austria, 
September  1991. 

Burke,  B.F.  "Searching  for  Exoplanets."  In  ESA 
Publications  for  the  Symposium  on  Targets  for 
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Space-Based  Interferometry,  Beaulieu,  France. 
October  1992. 

Meetings  and  Talks 

Burke,  B.F.  "Searching  for  Exoplanets."  University 
of  Massachusetts,  Amherst,  Massachusetts, 
April  1992,  and  University  of  Manchester, 
Manchester,  England,  November  1992. 

Burke.  B.F.  "The  Search  for  Other  Planetary 
Systems."  Talk  presented  at  the  Royal  Astro¬ 
nomical  Society,  London,  December  1992. 

Burke,  B.F.  "The  TOPS  Program."  California  Insti¬ 
tute  of  Technology  Summer  School,  Pasadena, 
California,  August  1992. 


1 .3  Studies  of  Gravitational  Lenses 

Sponsors 

National  Science  Foundation 

Presidential  Young  Investigator  Award 
David  and  Lucile  Packard  Fellowship 
for  Science  and  Engineering 

Project  Staff 

Professor  Jacqueline  N.  Hewitt,  Grace  H.  Chen, 
John  D.  Ellithorpe,  Matthew  K.  Gray,  Charles  A. 
Katz,  Pauline  McMahon,  Mark  D.  Messier, 
Christopher  B.  Moore.  Jie  Sun 

Gravitational  lenses  provide  radio  astronomers  with 
the  opportunity  to  reexamine  many  long-standing 
problems  in  astrophysics.  Two  of  these  problems, 
addressed  by  the  work  described  here,  are  the 
nature  and  distribution  of  dark  matter  in  galaxies 
and  the  calibration  of  the  age  and  scale  of  the  uni¬ 
verse  (i.e.,  determination  of  the  Hubble  constant). 
Both  applications  require  that  the  distribution  of 
gravitating  matter  be  inferred  from  the  observed 
imaging  properties  of  the  lens.  This,  in  turn, 
requires  detailed  measurements  of  the  brightness 
distribution  of  the  image.  Extensive  radio  maps  of 


two  gravitationally  lensed  extragalactic  radio 
sources,  MG1 131 +0456  and  MG041 4+0534,  are 
being  compiled  from  data  taken  with  a  Very  Large 
Array  radio  telescope.  Figure  1  shows  high- 
frequency  images  that  highlight  the  multiple  images 
of  the  active  galactic  nuclei  in  the  two  systems. 
The  configuration  of  these  images  provides  con¬ 
straints  on  models  of  the  mass  distribution  in  the 
lens;  other  information  is  derived  from  mapping  at 
lower  frequencies  where  other  structure  is  evident.^ 
There  are  differences  in  the  propagation  time,  of 
order  days  to  months,  for  the  different  ray  paths 
associated  with  the  images  of  the  active  galactic 
nucleus.  Monitoring  programs  aimed  at  measuring 
these  time  delays  are  being  carried  out  with  the 
Very  Large  Array  telescope.  Preliminary  plots  of  1 5 
GHz  flux  density  as  a  function  of  time  for  the  four 
images  of  MG041 4+0534  are  shown  in  figure  2. 
Simulations  of  time  delay  estimation  techniques 
indicate  that,  with  the  variability  displayed  by  the 
data  of  figure  2,  the  time  del.ays  in  this  system 
should  be  measurable. 


Figure  la.  22  GHz  radio  image  of  the  gravitational  lens 
MG1 131 +0456.  The  two  bright  sources  are  two  images 
of  the  same  background  galactic  nucleus. 


2  C.A.  Katz  and  J.N.  Hewitt.  Astrophys.  J.  Lett.,  forthcoming;  G.H.  Chen  and  J.N.  Hewitt,  "Multilrequency  Radio  Images  of 
MG1 1 31  +0456,"  in  Sub-Arcsecond  Radio  Astronomy,  ed.  R.J.  David  and  R.S.  Booth  (Manchester,  England:  CVP),  forthcoming. 
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Figure  1b.  15  GHz  radio  image  of  the  gravitational  lens 
MG0414-I-0534.  In  this  case,  there  are  four  images  of  the 
background  action  galactic  nucleus. 


function  of  time  for  the  four  components  of 
MG0414-t-0534.  All  four  components  display  variability  at 
a  level  not  uncommon  in  active  galactic  nuclei. 


1.4  Radio  Interferometry  of  Nearby  dMe 
Stars 

Sponsors 

National  Aeronautics  and  Space  Administration 
Grant  NAGW-2310 
David  and  Lucile  Packard  Fellowship 
for  Science  and  Engineering 

Project  Staff 

Professor  Jacqueline  N.  Hewitt,  John  D.  Ellithorpe, 
Christopher  B.  Moore 

dMe  stars  are  dwarf  M  stars  that  show  evidence  of 
surface  activity.  For  some  time  these  dwarf  stars 
have  been  known  to  flare  strongly  at  optical  and 
radio  wavelengths.  More  recently,  it  has  been  dem¬ 
onstrated  that  many  dMe  stars  exhibit  low-level 
quiescent  emission  that  is  detectable  in  Very  Long 
Baseline  Intererometry  (VLBI).  The  detection  of 
dMe  stars  on  VLBI  baselines  makes  possible  meas¬ 
urement  of  the  position  of  these  stars  with  high  pre¬ 
cision.  In  addition,  it  shows  that  astrometric 
detection  of  planetary  coi.. panions  may  be  feasible. 
The  astrometric  monitoring  program  has  continued 
with  third  epoch  measurements  of  dMe  stars. 


1.5  Algorithms  for  Advanced 
Microwave  Sounding  Unit  Operational 
Use 

Sponsors 

MIT  Lincoln  Laboratory 

SM  Systems  ana  Research  Corporation 

Project  Staff 

Professor  David  H.  Staelin,  Dr.  Philip  W. 
Rosenkranz 

This  project  provides  scientific  support  to  the 
National  Oceanic  and  Atmospheric  Administration 
(NOAA)  for  the  Advanced  Microwave  Sounding  Ur’  t 
(AMSU).  The  AMSU  is  scheduled  for  launch  on 
NOAA  polar-orbiting  weather  satellites  in  the 
mid-1990s  and  also  for  follow-on  instruments.  Our 
effort  emphasizes  development  of  line-by-line  atmo¬ 
spheric  transmittance  algorithms,^  estimation  of 


3  P.W.  Rosenkranz,  "Emission  and  Absorption  of  Microwaves  by  Atmospheric  Oxygen  and  Water  Vapor,"  paper  presented  at  the  Spe¬ 
cialist  Meeting  on  Microwave  Radiometry  and  Remote  Sensing  Applications.  Boulder,  Colorado,  January  14-16,  1992;  H.J.  Liebe, 
P.W.  Rosenkranz,  and  G.A.  Hufford,  "Atmospheric  60-GHz  Oxygen  Spectrum:  New  Laboratory  Measurements  and  Line  Para¬ 
meters,"  J.  Quant.  Spectrosc.  Radial  Transfer  ^6{5/6):  629-643  (1992). 


218  RLE  Progress  Report  Number  135 


Chapter  1 .  Radio  Astronomy 


surface  emissivity  and  precipitation,  issues  related 
to  instrument  design  and  specification,  and  general 
retrieval  methods. 

Measurements  from  the  MIT  aircraft-based  Micro- 
wrave  Temperature  Sounder  and  from  present  satel¬ 
lite  radiometers  such  as  the  Special  Sensor 
Microwave  Imager*  and  the  Microwave  Sounding 
Unit  are  used  to  develop  and  test  prototype 
parameter-retrieval  algorithms. 


1.6  Earth  Observing  System: 

Advanced  Microwave  Sounding  Unit 

Sponsor 

National  Aeronautics  and  Space  Administration/ 
Goddard  Space  Flight  Center 
Contract  NAS  5-30791 

Project  Staff 

Professor  David  H.  Staelin,  Dr.  Philip  W. 
Rosenkranz,  John  W.  Barrett,  Michael  J.  Schwartz, 
Carlos  R.  Cabrera 

The  Advanced  Microwave  Sounding  Unit  (AMSU) 
will  provide  microwave-band  measurements  to  the 
Atmospheric  Infrared  Sounder  (AIRS),  which  will  be 
a  facility  instrument  on  NASA’s  planned  Earth 
Observing  System  (EOS).  This  effort  is  directed 
toward  developing  algorithms  for  the  retrieval  of 
temperature  and  humidity  profiles,  precipita'ion,  sea 
ice,  land  snow  cover,  and  other  parameters.® 
According  to  our  present  plans,  the  temperature 
and  humidity  profiles  obtained  from  AMSU  will 
provide  initial  conditions  for  the  AIRS  infrared  cloud- 
correction  and  profile  retrieval  algorithms.  Current 
work  is  focused  on  simulation  efforts  and  on  a  rapid 
(as  distinguished  from  line-by-line)  microwave  trans¬ 
mittance  algorithm. 


1.7  High-Resolution  Passive  Microwave 
imaging  of  Atmospheric  Structure 

Sponsor 

Natio^’al  Aeronautics  and  Space  Administration/ 
Goddard  Space  Flight  Center 
Grant  NAG  5-10 

Project  Staff 

Professor  David  H.  Staelin,  John  W.  Barrett,  Dr. 
Philip  W.  Rosenkranz,  Michael  J.  Schwartz,  Paul 
W.  Fieguth 

The  MIT  Microwave  Temperature  Sounder  (MTS)  is 
a  dual-band  radiometer,  incorporating  an  imaging 
spectrometer  at  118  GHz  and  a  tunable  fixed-beam 
radiometer  at  52-54  GHz.  This  instrument  is  flown 
on  the  NASA  ER-2  high-altitude  aircraft.  In  Feb¬ 
ruary  and  March  1992,  we  participated  in  the 
STORM  Fronts  Experiment  Systems  Test 
(STORM-FEST),  which  was  a  national  field  exper¬ 
iment  to  provide  research  background  and  experi¬ 
ence  with  new  weather  observing  systems.  Several 
winter  storms  in  the  Kansas-Oklahoma  region  were 
overflov  n.  These  measurements  will  be  used  to 
study  the  response  of  microwave  radiometer 
measurements  to  precipitation  and  weather  fronts. 

In  May,  July,  and  August  1992,  the  MTS  was  flown 
along  with  the  Microwave  Imaging  Radiometer,  a 
NASA  instrument,  under  the  orbital  track  of  a  satel¬ 
lite  carrying  the  new  DMSP  SSMT-2  instrument,  a 
microwave  humidity  profiler.  These  flights  will  be 
used  to  validate  satellite  measurements. 

On  other  flights,  upward-observing  profiles  of  atmo¬ 
spheric  brightness  temperature  were  obtained. 
These  will  be  used  to  test  recently  developed  atmo¬ 
spheric  transmittance  algorithms. 


1.8  Rapid  Precision  Net-Form 
Manufacturing 

Sponsor 

MIT  Leaders  for  Manufacturing  Program 


■*  P.W.  Rosenkranz,  "SSM/I  Measurements  as  Predictors  of  the  Response  of  AMSU-A  to  Surface  and  Atmospheric  Phenomena,'  paper 
presented  at  the  First  NMC/NESDIS/DOD  Conference  on  DMSP  Retrieval  Products,  Washington,  DC,  /^pril  14-15,  1992;  P.W. 
Rosenkranz,  "Rough-Sea  Microwave  Emissivities  Measured  witn  the  SSM/I,"  IEEE  Trans.  Geosci.  Remote  Sens.  30(5):  1081-1085 
(1992). 

i  D.H.  Staelin,  "Passive  Microwa  i  Observations  of  Atmospheric  Parameters  from  Space,"  paper  presented  at  :he  Specialist  Meeting 
on  Microwave  Radiometry  and  Remote  Sensing  Applications,  Boulder,  Colorado,  January  14-16, 1992. 
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Project  Staff 

Professor  David  H.  Staelin,  Paul  W.  Fieguth 

Algorithms  were  improved  for  estimating  the  shape 
of  manufactured  objects  with  a  precision  of  microns 
at  very  high  data  rates.®  The  technique  is  to  esti¬ 
mate  in  optimum  fashion  the  distance  at  which 
minimum  defocusing  blur  is  encountered  at  each 
point  on  the  object:  multiple  images  are  viewed 
(typically  four  or  five)  at  different  ranges  while  the 
object  is  illuminated  with  a  fine  grid  of  illuminated 
points  of  light  (approximately  600  were  observed). 
The  experiment  involves  use  of  a  previously  devel¬ 
oped  four-axis  stage  with  a  simultaneously 
scannable  volume  of  -20  cubic  inches.  A  512^ 
pixel  CCD  camera  is  employed  with  an  adjustable 
field  of  view  down  to  0.5  mm^.  The  system  is  fully 
computer  controlled  and  can  receive  up  to  30 
frames  per  second. 

Rms  accuracies  of  50  nm  were  obtained  for  ideal 
mirror  surfaces  for  the  0.5-mm  field  and  a  stand-off 
distance  of  a  few  mm;  this  accuracy  degraded  to 
100  nm  when  faster  and  simpler  algorithms  were 
employed.  These  accuracies  were  higher  than 
those  expected,  partly  because  the  nature  of  the 
blur  function  is  strikingly  asymmetric,  as  can  be 
seen  from  thick  lens  ray  tracing  (but  not  thin  lens  or 
diffraction  optics).  For  rougher  materials — chalk, 
black  plastic,  and  copper— accuracies  of  1000-2000 
nm  were  obtained.  Chalk  and  plastic  are  difficult  to 
test  because  light  diffuses  inside  them  on  this 
spatial  scale;  copper  exhibits  strong  glint  from  its 
microcrystalline  surface.  The  system  scaling  laws 
derived  reveal  that  these  results  might  be  useful  in 
applications  to  much  larger  systems. 


1.9  Conformal  Experiment  Design 

Sponsor 

MIT  Leaders  for  Manufacturing  Program 

Project  Staff 

Professor  David  H.  Staelin,  Ashraf  S.  Alkhairy,  Paul 
W.  Fieguth 


Optimization  of  operating  parameters  for  products 
and  manufacturing  processes  are  often  determined 
by  experiment  in  cases  where  theoretical  analysis  is 
inadequate.  The  performance  of  these  products 
and  processes  is  often  limited  because  of  the  high 
cost  of  such  experiments:  experimentation  simply 
ceases  when  the  allocated  budget  or  time  ends. 
Using  most  conventional  techniques  for  experiment 
design,  such  as  those  of  Taguchi  and  Box,  on  the 
order  of  experiments  are  required,  where  N  is 
the  approximate  number  of  important  parameters. 
Using  newly  developed  techniques,  Dr.  Ashraf 
Alkhairy^  has  shown  that  in  many  cases  of  practical 
interest,  on  the  order  of  2N  experiments  are  suffi¬ 
cient  to  yield  excellent  results. 

The  coined  phrase  "conformal  experiment  design" 
suggests  the  way  in  which  these  new  techniques 
probe  the  complexity  of  the  product  or  process 
space  and  guide  definition  of  efficient  sets  of  exper¬ 
iments.  This  new  approach  conforms  to  the 
process  or  product  of  interest  by  taking  advantage 
of  any  intrinsic  simplicity  and  a  priori  knowledge 
and  by  matching  more  closely  the  chosen  quality 
cost  functions  and  statistical  models  to  the  problem 
at  hand.  This  involves  performing  a  series  of 
experiments  which  ends  when  those  experiments 
suggest  that  enough  or  all  unaccounted  second- 
order  interactions  are  negligible:  the  implication  is 
that  all  higher-order  interactions  are  then  probably 
negligible  too.  Other  methods  often  overlook  some 
of  these  interactions  even  with  experiments,  and 
thus  can  yield  inferior  results. 

The  impact  of  these  new,  more  efficient  methods 
for  experiment  design  can  be  substantial  when 
studying  processes  and  products  that  are  character¬ 
ized  by  so  many  significant  parameters  that 
experiments  are  not  practical.  Therefore,  exper¬ 
imental  optimization  can  be  employed  now  for 
larger  and  more  complex  systems  or  subsystems 
with  larger  values  of  N.  This  could  lead  to  signif¬ 
icantly  superior  performance  in  cases  where 
sequential  optimizations  of  subsets  of  N  parameters 
must  othenwise  be  performed. 

Initial  comparisons  of  these  methods  with  more  tra¬ 
ditional  Taguchi  and  response-surface  methods 


6  P.W,  Fieguth,  Range  Estimation  Accuracy  Using  Depth-from-focus  Methods — Theory  and  Experiment.  S.M.  thesis.  Dept,  of  Eiectr. 
Eng.  and  Comput.  Sci.,  MIT,  1992. 

7  A.S.  Alkhairy,  Optimal  Product  and  Manufacturing  Process  Selection — Issues  of  Formulation  and  Methods  for  Parameter  Design. 
Ph  D.  diss..  Dept,  of  Eiectr.  Eng.  and  Comput.  Sci.,  MIT,  1991.  Also  RLE  TR  No.  572. 
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(RSM)  by  J.E.  Glancy®  and  FJ.  MacDonald,  Jr.,® 
revealed  that  the  conformal  design  technique 
yielded  both  better  performance  and  process  char¬ 
acterization  and  required  fewer  experiments  for 
these  cases  (MacDonald  did  not  test  RSM).  Glancy 
obtained  a  68%  increase  in  mean  process  perfor¬ 
mance  for  a  heat  sealing  manufacturing  operation 
and  tripled  process  capacity  with  only  eight  con¬ 
formal  experiments.  After  16  experiments, 
MacDonald  reduced  marginal  manufacturing  costs 
for  a  fiber  drawing  process  by  30%  and  more  than 
doubled  the  capacity  of  the  existing  drawing  equip¬ 
ment  while  improving  quality  sufficiently  to  establish 
a  new  premium  product  line. 

Our  recent  work  has  been  directed  toward  genera¬ 
tion  of  a  simple  user-friendly  program  to  assist  engi¬ 
neers  with  both  experiment  design  and  analysis  of 
the  results.  This  program  accepts  a  list  of  control 
parameters  and  another  of  the  most  likely  nonlinear 
interactions  between  them.  The  program  then 
recommends  sets  of  8,  16,  32,  64  or  more  exper¬ 
iments,  as  selected  by  the  user.  These  experiment 
sets  are  orthogonal  arrays  designed  to  evaluate 
preferentially  those  interactions  of  greatest  concern. 
The  analysis  portion  of  the  initial  version  will  accept 
the  experiment  results,  expressed  as  a  user-chosen 
quality  metric  that  incorporates  errors  in  both  mean 
and  variance.  It  outputs  an  equation  relating  the 
input  parameters  to  the  quality  and  a  list  of  param¬ 
eter  values  that  should  yield  maximum  quality;  this 
recommendation  can  readily  be  checked  with  only 
one  or  a  few  more  experiments  using  these  values. 
Once  its  development  is  complete,  the  program  will 
be  tested  in  industry  and  then  improved  further. 


1.10  Recognition  of  Naturai  Neariy 
Repetitive  Signals 

Sponsor 

MIT  Leaders  for  Manufacturing  Program 

Project  Staff 

Professor  David  H.  Staelin,  Kevin  G.  Christian 

Dolphins  (and  some  machinery)  emit  nearly  repeti¬ 
tive  whistles  resembling  bird  song.  This  project  is 
exploring  methods  for  compressing  such  signals  so 
that  the  repetitious  elements  might  be  rapidly  identi¬ 
fied.  The  challenge  is  to  maximize  performance 
with  minimal  (perhaps  zero)  training  with  the 
system. 

This  year  a  broad  class  of  robust  measures  of 
whistle  parameters  was  applied  to  a  database  of 
approximately  1160  dolphin  whistles  collected  for 
22  animals.  We  used  contact  hydrophones  to 
reduce  the  effects  of  environmental  noise  and  prop¬ 
agation  variations.  Our  current  research  involves 
developing  methods  for  compact  representation  of 
the  whistles  and  their  efficient  recall  from  a  com¬ 
puter  database.  Preliminary  search  algorithms  for  a 
16-dimension  space  require  searching  roughly  five 
percent  of  the  total  data  base,  and  sometimes 
more,  to  find  the  best  Euclidian  match.  The  reduc¬ 
tions  in  search  time  obtained  by  less  perfect 
searches  are  being  explored.  These  same  algo¬ 
rithms  should  also  be  useful  for  diagnosing  acoustic 
and  other  signals  produced  by  machinery,  manufac¬ 
turing  processes,  and  other  environments. 

Studies  of  these  dolphin  whistles  using  these  same 
whistle  metrics  have  revealed  a  dolphin  has  much 
better  control  over  frequency  when  reproducing  its 
own  signature  whistles  than  when  copying  that  of 
another  dolphin. 


8  J.E.  Glancy  Jr.,  Manufacturing  Cost  Modeling  and  Robust  Process  Design,  S.M.  thesis,  Dept,  of  Mech.  Eng.  and  Dept,  of  Manage., 
MIT,  1992. 

9  F.J.  MacDonald,  Jr.,  An  Integrative  Approach  to  Process  Parameter  Selection  in  Fused-Silica  Capillary  Manufacturing,  S.M.  thesis. 
Dept,  of  Electr.  Eng.  and  Comput.  Sci.  and  Dept,  of  Manage.,  MIT,  1992. 
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1.1  Custom  Integrated  Circuits 

Sponsors 

Analog  Devices,  Inc. 

IBM  Corporation 

Project  Staff 

Professor  Jonathan  Allen,  Robert  C.  Armstrong, 
Donald  G.  Baltus,  Lisa  A.  Pickelsimer,  Mark  W. 
Reichelt,  Chin  Hwee  Tan,  Filip  J.  Van  Aelten 

The  overall  goal  of  VLSI  CAD  research  is  to  provide 
the  means  to  produce  custom  integrated  circuits 
quickly,  correctly,  and  economically.  Traditionally, 
correctness  has  been  verified  at  several  represent¬ 
ational  levels  of  abstraction,  such  as  layout  (via 
design  rule  checking),  and  circuit  and  logic  repre¬ 
sentations  (both  via  simulation).  These  techniques 
for  checking  correctness  are  usually  local  to  the 
particular  representational  level  involved  and,  while 
these  techniques  are  important  components  of  the 
overall  design  testing  procedure,  they  do  not 
attempt  to  provide  for  the  alignment  and  consist¬ 
ency  checks  between  the  different  abstract  repre¬ 
sentational  levels  and  an  input  behavioral 


specification.  In  addition,  they  do  not  characterize 
the  set  of  possible  designs  at  each  representational 
level  corresponding  to  the  initial  functional  specifica¬ 
tion  in  a  way  that  ranges  over  the  wide  variety  of 
possible  performance  levels.  For  this  reason,  there 
is  an  increasing  need  to  provide  CAD  tools  that 
serve  as  a  framework  for  design  exploration,  thus 
providing  the  desired  performance  together  with 
consistently  aligned  representations  at  all  levels. 

This  research  group  studies  a  variety  of  topics  with 
an  emphasis  on  performance-directed  synthesis  of 
custom  VLSI  designs.  An  overview  of  the  viewpoint 
that  motivates  these  projects  has  recently  been  pro¬ 
vided  in  a  major  survey  paper*  in  which  the  need 
for  coordinating  the  design  optimization  process 
over  the  several  levels  of  representation  is  empha¬ 
sized.  Since  design  exploration  is  so  central  to  the 
production  of  high-performance  designs,  emphasis 
is  placed  on  how  performance  can  be  characterized 
at  the  several  levels  of  representation  and  how 
overall  optimal  performance  can  be  achieved  in  an 
integrated  way. 

In  addition  to  the  basic  issues  of  circuit  optimiza¬ 
tion,  architectures  for  digital  signal  processing  have 
been  studied  because  of  the  highly  parallel  nature 


''  Computer  Science  Department,  General  Motors  Research  Laboratory,  Warren,  Michigan. 

2  J.  Allen,  "Performance-Directed  Synthesis  of  VLSI  Systems,”  Proc.  IEEE  78(2):  336-355  (1990). 


Chapter  1 .  Custom  Integrated  Circuits 


of  the  algorithms  involved  and  the  need  for  a  very 
high  level  of  real-time  performance  in  these 
systems.  Emphasis  on  developing  formally  speci¬ 
fied  systems  has  increased  so  that  the  design 
space  can  be  comprehensively  searched,  and  verifi¬ 
cation  of  the  resultant  design  can  be  confirmed  in 
terms  of  the  initial  behavioral  specification.  Current 
projects  focus  on  design  methodologies  for  very 
high-speed  clocked  circuits,  the  trade-offs  between 
power  dissipation  and  circuit  delay  with  transistor 
sizing  as  a  parameter,  design  of  highly  optimized 
array  architectures  for  digital  signal  processing,  and 
provision  of  a  VLSI  design  database  that  provides 
for  incremental  consistency  maintenance. 

There  has  been  a  continuing  evolution  of  circuit 
styles  to  provide  high  performance  in  a  minimal 
space.  For  example,  in  CMOS  technology,  static 
designs  are  utilized,  but  precharge-evaluate  meth¬ 
odologies  have  also  been  employed  to  provide 
speed,  minimize  the  number  of  transistors,  and 
increase  utilization  of  n-channel  devices  (rather 
than  slower  p-channel  devices).  In  recent  years, 
examples  of  a  new  circuit  style  called  True  Single- 
Phase  Clocking  (TSPC)  have  been  published.^  This 
style  provides  for  clock  speeds  of  several  hundred 
megahertz  in  modest  CMOS  technologies.  While 
these  results  have  been  confirmed,  an  analysis  that 
explains  the  speed  has  not  been  provided,  and  no 
overall  design  methodology  for  these  circuits  has 
been  demonstrated.  In  her  master's  thesis 
research  project,  Pickelsimer"  has  analyzed  the  per¬ 
formance  of  TSPC  circuits  as  well  as  other 
precharge-evaluate  forms,  in  the  context  of  a  ripple- 
carry  binary  adder.  A  library  of  basic  TSPC  circuit 
elements  has  been  provided,  as  well  as  a  compre¬ 
hensive  design  methodology  for  the  optimal  struc¬ 
turing  of  TSPC  and  TSPC-related  circuits  for 
minimal  delay.  In  this  way,  it  is  possible  to  system¬ 
atically  derive  an  optimal  circuit  form  from  an  input 
logical  specification  through  the  appropriate  combi¬ 
nation  of  matched  circuit  forms.  All  of  these  circuits 
require  only  a  single-phase  clock,  without  the  need 
for  clock  waveform  inversion,  and  hence  avoid  the 
problems  of  clock  skew  and  waveform  overlap. 


It  has  frequently  been  observed  that  sizing  of  tran¬ 
sistors  (i.e.,  variation  of  MOSFET  channel  widths) 
can  significantly  improve  the  speed  of  many  circuit 
forms.  In  particular,  optimal  procedures  have  been 
provided  to  examine  the  trade-off  of  circuit  delay 
with  aggregate  transistor  channel  width.  Even  more 
recently,  techniques  have  been  devised  to  probabi¬ 
listically  characterize  switching  activity  on  circuit 
nodes  in  terms  of  the  logic  function  being  per¬ 
formed  and  hence  deriving  power  estimates  for  the 
circuit,  which  can  be  modified  through  logical  trans¬ 
formations.®  Tan,  in  her  master's  thesis  work,  is 
studying  the  trade-off  between  circuit  delay  and 
power  dissipation  (as  characterized  by  probabilistic 
power  estimates)  with  transistor  channel  width  size 
as  a  parameter.  The  results  of  this  study  will  be 
useful  in  maximizing  the  performance  of  low-power 
circuitry  or  minimizing  the  power  of  circuits 
designed  to  meet  a  given  delay  specification. 

In  a  doctoral  dissertation  project  by  Baltus,  compre¬ 
hensive  optimization  of  array  logic  designs  is  being 
achieved  over  all  design  representations  from  the 
input  function  specification  to  the  final  layout.  The 
goal  is  to  generate  a  custom  layout  specification 
from  an  input  functional  description  using  a  lan¬ 
guage  called  FLASH,  which  is  well  suited  to  func¬ 
tionally  characterizing  signal  processing  algorithms, 
and  can  be  viewed  as  an  extension  of  the  SILAGE 
language.®  FLASH  designs  are  converted  to  data 
dependence  graphs,  where  some  elements  of  these 
graphs  are  reused  and  indexed  to  provide  a 
compact  representation  that  does  not  grow  with  the 
size  of  the  array.  Since  algorithms  are  mapped 
onto  a  class  of  architectures  with  well-defined  struc¬ 
tural  and  interconnection  characteristics,  important 
implementation-level  costs  can  be  accurately 
modeled  at  the  architectural  level.  More  specif¬ 
ically,  the  structured  nature  of  the  target  architec¬ 
ture  allows  the  temporal  and  spatial  relationships 
between  computations  to  be  seen  at  the  architec¬ 
tural  level. 

The  availability  of  an  abstract  model  which  accu¬ 
rately  predicts  implementation-level  characteristics 
allows  the  process  of  performance-directed  syn¬ 
thesis  to  be  formalized.  Powerful  techniques  from 


3  J.  Yuan  and  E.  Svensson,  "High-Speed  Circuit  Technique,"  IEEE  J.  Solid-State  Circ.  24(1):  62-70  (1989). 

<  L.A.  Pickelsimer,  A  Structured  Design  Methodology  for  Speed-Optimized  True  Single-Phase-Clock  Dynamic  CMOS.  S  M.  thesis, 
Dept,  of  Electr.  Eng.  and  Comput.  Sci.,  MIT,  1992. 

5  A.  Shen,  A.  Ghosh,  S.  Devadas,  and  K.  Keutzer,  "On  Average  Power  Dissipation  and  Random  Pattern  Testability  of  CMOS  Combi¬ 
national  Logic  Networks,"  Digest  of  Technical  Papers,  lEEE/ACM  International  Conference  on  Computer-Aided  Design,  Santa  Clara, 
California,  November  1992,  pp.  402-407. 

6  P.N.  Hilfinger,  "A  High-Level  Language  and  Silicon  Compiler  for  Digital  Signal  Processing,"  Proceedings  of  the  IEEE  1985  Custom 
Integrated  Circuits  Conference,  Portland,  Oregon,  May  1985,  pp.  213-216. 
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graph  theory,  optimization  theory,  and  linear 
algebra  can  be  used  to  allow  the  space  of  possible 
implementation  architectures  to  be  efficiently,  com¬ 
prehensively,  and  systematically  explored.  A 
systolic  array  synthesis  method  has  been  devel¬ 
oped  which  exploits  these  techniques.  The  space 
of  possible  implementation  architectures  is  effi¬ 
ciently  explored,  and  a  set  of  provably  optimal 
register-transfer-level  implementations  is  generated. 
The  exhaustive  nature  of  the  design  exploration 
process  ensures  that  all  optimal  designs  are  gener¬ 
ated.  Compared  to  other  work  in  this  area,  the 
developed  technique  is  applicable  to  a  wider  class 
of  algorithm  and  is  the  only  system  of  its  kind  which 
incorporates  exhaustive  architectural  exploration. 

Armstrong  has  built  a  multirepresentation  VLSI 
database  that  provides  for  incremental  consistency 
maintenance.^  The  goal  of  this  system  is  to  auto¬ 
matically,  incrementally,  and  continuously  provide 
for  consistent  alignment  of  all  levels  of  design 
abstraction.  The  system  provides  a  design  environ¬ 
ment  which  maintains  these  consistency  linkages 
while  allowing  the  designer  to  initiate  changes  at 
any  level  of  representation.  Alignment  is  achieved 
through  the  introduction  of  explicit  correspondence 
zones  between  the  adjacent  representation  levels. 
Some  of  the  necessary  translation  rules  to  maintain 
correspondence  are  built  into  the  system,  but  there 
is  also  provision  for  the  user  to  introduce  new 
relationships  between  levels  of  design  represen¬ 
tation.  The  initial  implementation  of  this  system  has 
required  over  60,000  lines  of  code,  written  in  C,  but 
database  schemas  that  are  written  in  C++  will  now 
be  introduced  to  vastly  compress  the  size  of  the 
code,  increase  the  speed  of  the  database,  and  min¬ 
imize  errors  and  the  need  for  debugging.  In  addi¬ 
tion.  the  ability  to  explore  designs  at  a  given  level  of 
representation  is  being  coupled  to  the  basic  data¬ 
base  structure.  A  circuit  simulator  is  being  con¬ 
nected  at  the  circuit-level  of  representation,  and  a 
transistor-sizing  algorithm  is  also  being  used  at  that 
level.  Since  there  will  always  be  design  require¬ 
ments  that  have  not  been  contemplated  in  the  exis¬ 
ting  design,  a  default  capability  is  being  provided 
that  will  indicate  structural  regions  that  must  be 
changed  in  a  design  in  response  to  a  user-initiated 
change  at  another  level  without  automatically  pro¬ 
viding  the  translation  rule  that  maintains  corre¬ 
spondence.  In  this  way,  users  will  not  be  restricted 
to  a  system-provided  set  of  correspondences,  but 
may  continuously  introduce  new  translations  that 


may  eventually  be  assimilated  into  the  overall 
system. 

There  is  increasing  interest  in  overall  CAD  systems 
where  the  database  is  central  and  surrounded  by 
specific  applications  for  design  and  verification  at  a 
variety  of  different  levels  of  representation,  such  as 
in  the  database  just  described.  This  research 
group  is  currently  using  a  network  of  workstations 
that  provides  an  implicit  client-server  architecture 
together  with  coupling  to  a  mainframe  and  its 
attached  large  tape  and  disk  capabilities  intercon¬ 
nected  with  high-speed  fiberoptic  switching. 
Scalable,  parallel  architectures,  particularly  useful 
for  highly  parallel  simulation  codes,  are  now  being 
studied  for  introduction  into  this  framework.  They 
will  also  support  multiple  client  utilization  of  the 
database  with  extremely  fast  communication 
between  the  contributing  processors.  This  system 
will  serve  as  a  test  bed  for  the  combined  utilization 
of  high-performance  workstations  and  mainframe- 
based  storage  for  high-speed,  reliable  database 
access.  This  environment  also  provides  a  structure 
in  which  distributed  parallel  algorithms  can  run 
coherently  within  an  overall  environment  that  sup¬ 
ports  other  specialized,  nonparallel  CAD  algorithms. 

1.2  Analog  VLSI  Systems  for  Integrated 
Image  Acquisition  and  Early  Vision 
Processing 

Sponsor 

National  Science  Foundation 
Grant  MIP  91-17724 

Project  Staff 

Professor  John  L.  Wyatt,  Jr.,  Professor  Berthold 
K.P.  Horn,  Professor  Hae-Seung  Lee,  Dr.  Ichiro 
Masaki,  Professor  Charles  G.  Sodini,  Professor 
Jacob  K.  White 


1.2.1  Project  Summary 

In  real-time  machine  vision  the  sheer  volume  of 
image  data  to  be  acquired,  managed  and  pro¬ 
cessed  leads  to  communications  bottlenecks 
between  imagers,  memory,  and  processors,  and 
also  to  very  high  computational  demands.  We  are 
designing  and  testing  experimental  analog  VLSI 


’  R  C  Armstrong  and  J.  Allen,  "FICOM:  A  Framework  for  Incremental  Consistency  Maintenance  in  Multi- Representation,  Structural 
VLSI  Design  Databases."  Digest  ol  Technical  Papers,  lEEE/ACM  International  Conference  on  Computer-Aided  Design,  Santa  Clara, 
California,  November  1992,  pp  336-343. 
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systems  to  overcome  these  problems.  The  goal  is 
to  determine  how  the  advantages  of  analog 
VLSI — high  speed,  low  power  and  small  area — can 
be  exploited,  and  its  disadvantages— limited  accu¬ 
racy,  inflexibility  and  lack  of  storage  capacity — can 
be  minimized.  The  work  is  concentrated  on  early 
vision  tasks,  i.e.,  tasks  that  occur  early  in  the  signal 
flow  path  of  animal  or  machine. 

The  next  section  details  substantial  progress  on 
eleven  different  designs,  including  systems  for 
camera  motion  estimation,  edge  detection,  auto¬ 
matic  alignment  to  calibration  marks,  brightness 
adaptive  imaging,  and  other  tasks.  Perhaps  the 
most  exciting  news  concerns  the  stereo  vision 
system,  the  initial  portion  of  which  was  completed 
by  J.  Mikko  Hakkarainen  in  his  doctoral  work  with 
Professor  Hae-Seung  Lee  last  May.  This  project 
has  attracted  the  attention  of  General  Motors,  which 
sent  a  vision  systems  engineer.  Dr.  Ichiro  Masaki, 
to  Boston  for  a  year  to  work  with  us  and  investigate 
the  possibilities  of  this  type  of  design  for  future  use 
in  automobiles.  He  has  made  a  real  contribution  to 
this  project  in  directing  our  attention  to  practical 
automotive  applications. 


I. 2.2  Vision  Chip  Designs 

Real-Time  Stereo  Vision  System  in 
CCD/CMOS  Technology 

J.  Mikko  Hakkarainen  completed,  for  his  doctoral 
thesis  under  Professor  Hae-Seung  Lee,  the  design 
of  an  analog  VLSI  design  for  a  high-speed 
binocular  stereo  vision  system  used  for  the 
recovery  of  scene  depth.  In  this  context  he  has 
attempted  to  exploit  the  principal  advantages  of 
analog  VLSI— small  area,  high  speed,  and  low 
power— while  minimizing  the  effects  of  its  traditional 
disadvantages — limited  accuracy,  inflexibility,  and 
lack  of  storage  capacity.  A  CCD/CMOS  stereo 
system  implementation  was  proposed,  capable  of 
processing  several  thousand  image  frame  pairs  per 
second  for  40x40  pixel  binocular  images,  which 
should  yield  approximately  frame  rate  operation  on 
256x256  pixel  image  pairs.  A  shift-and-compare 
module  of  the  stereo  system  has  been  fabricated 
and  characterized. 

The  algorithmic  focus  of  this  research  was  on 
binocular  stereopsis  and  the  stereo  correspondence 
problem  in  particular.  Three  stereo  algorithms 
(Marr-Poggio-Drumheller,  Pollard-Mayhew-Frisby, 
and  Prazdny)  were  considered  in  detail  as  solutions 
to  this  problem  and  as  candidates  for  VLSI  imple¬ 
mentation.  The  most  crucial  step  in  the  recovery  of 
stereo  correspondence  was  found  to  be  the  corre¬ 
lation  of  left  and  right  images  through  candidate 


matches  and  local  support  scores.  The  Marr- 
Poggio-Drumheller  (MPD)  algorithm  was  shown  to 
be  best  suited  for  an  efficient  VLSI  implementation. 
A  detailed  simulation  study  of  this  algorithm  was 
presented.  The  simulations  were  specifically 
designed  to  address  issues  related  to  analog  VLSI 
implementation. 

A  full  MPD  stereo  system  consists  of  seven 
modules:  two  imagers  (left  and  right  views),  two 
image  filters,  a  shift-and-compare  module 
("correlator"),  a  local  support  filter,  and  a  vote  taker. 
The  shift-and-compare  module  together  with  the 
local  support  filter  perform  the  largest  number  of 
operations  per  pixel  in  the  stereo  system,  and  thus 
they  present  the  speed  bottleneck.  Of  the  two 
modules,  the  shift-and-compare  function  is  more 
demanding  for  hardware  implementation,  partic¬ 
ularly  because  the  local  support  filter  has  a  simple 
low-pass  characteristic  that  has  already  been  dem¬ 
onstrated  previously  using  the  CCD/CMOS 
approach.  For  these  reasons  the  shift-and-compare 
circuit  was  selected  for  design  and  fabrication.  A 
40x40  pixel  absolute-value-of-difference  (AVD) 
array,  the  core  processor  of  the  shift-and-compare 
function,  was  fabricated  through  MOSIS  in  a  2  urn 
CCD/CMOS  process.  The  array  can  perform  the 
AVD  operation  between  two  arrays  a  programmable 
number  of  times.  At  a  10  MHz  clock  rate  for  the 
CCD  processor,  approximately  1000  array  (image 
frame)  pairs  can  be  processed  per  second  for  a 
total  disparity  range  of  11  pixels.  Special  purpose 
test  circuits  were  designed  that  are  compatible  with 
such  high  speed  processing  requirements.  Indi¬ 
vidual  AVD  processors  as  well  as  the  full  array 
were  characterized.  Array  functionality  was  demon¬ 
strated  in  the  stereo  algorithm  context  by  running 
the  MPD  stereo  system  with  actual  data  from  the 
AVD  chip.  Upon  comparison,  the  chip  output  was 
seen  to  correspond  closely  with  fully  computerized 
stereo  simulations  for  both  artificial  and  real  input 
image  pairs. 

A  Dynamic-Wires  Chip  for  Pixel  Clustering 

Dr.  John  G.  Harris,  working  in  collaboration  with 
Shih-Chii  Liu  from  the  Rockwell  Science  Center, 
has  continued  to  develop  the  dynamic  wires 
paradigm  for  solving  a  new  class  of  computer  vision 
problems  with  analog  hardware.  This  methodology 
provides  dedicated  lines  of  communication  among 
groups  of  pixels  of  an  image  which  share  common 
properties.  In  simple  applications,  object  regions 
can  be  grouped  together  to  compute  the  area  or 
the  center  of  mass  of  each  object.  Alternatively, 
object  boundaries  may  be  used  to  compute  curva¬ 
ture  or  contour  length.  These  measurements  are 
useful  for  higher  level  tasks  such  as  object  recogni¬ 
tion  or  structural  saliency.  The  dynamic  wire  meth- 
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odology  is  efficiently  implemented  in  fast,  low-power 
analog  hardware.  Switches  create  a  true  electrical 
connection  among  selected  pixels — dynamically 
configuring  wires  or  resistive  networks  on  the  fly. 
Dynamic  wires  provide  a  different  processing 
approach  from  that  of  present  vision  chips,  which 
are  limited  to  pixel-based  or  image-based  oper¬ 
ations.  Using  this  methodology,  they  have  success¬ 
fully  designed  and  demonstrated  a  custom  analog 
VLSI  chip  which  computes  contour  length. 

Single-Chip  CMOS  Imager  and  Smoothing 
and  Segn'  ntation  System  Using  Resistive 
Fuses 

Paul  Yu,  working  with  Professor  Hae-Seung  Lee, 
has  designed,  built,  and  tested  an  integrated  CMOS 
resistive-fuse  processor  capable  of  smoothing  out 
noise  while  preserving  the  edges  of  a  32x32  image. 
The  on-chip  photo-transistor  imager  converts  the 
optica!  image  into  electrical  currents.  Through  a 
system-level  optimization,  a  single-pixel  processing 
element  using  only  15  transistors  for  the  horizontal 
resistive  fuses  and  only  one  transistor  for  the  ver¬ 
tical  resistor  was  realized.  The  processed  output  is 
read  out  using  a  row  decoder  and  a  column  MUX. 
Since  the  on-resistance  and  off-voltage  of  the 
resistive  fuses  can  be  electronically  controlled,  one 
can  vary  the  degree  to  which  the  image  is 
smoothed  and  segmented  electronically.  The  chip 
was  fabricated  through  MOSIS  and  tested;  it  works 
well.  The  chip  contains  over  19,000  transistors  and 
occupies  a  9.2  mm  x  7.9  mm  area.  Operating  from 
a  single  5-V  supply,  it  typically  dissipates  only  10 
mW. 

Novel  Operational  Amplifier  for  Video  Rate 
Analog-to-Digital  Conversion 

Paul  Yu  is  working  with  Professor  Hae-Seung  Lee 
on  novel  operational  amplifiers  to  serve  as  key 
components  in  a  video-rate  pipelined  analog-to- 
digital  converter.  A  new  gain-enhancement  tech¬ 
nique,  using  a  replica  amplifier,  is  being  studied. 
Although  this  technique  is  general  and  can  be 
applied  to  any  conventional  op-amp  topologies  such 
as  folded-cascode,  class-AB,  differential,  or  single- 
ended,  a  two-stage  topology  without  any  cascade 
was  chosen  to  maximize  the  available  output  swing 
and  input  common-mode  range.  Experimental 
results  show  that  the  gain  enhancement  technique 
improves  the  open-loop  gain  by  a  factor  of  13 
without  adversely  affecting  the  output  swing  or  the 
settling  behavior.  The  amplifier  achieves  an  effec¬ 
tive  open-loop  gain  of  10,800  using  only  minimum- 
length  transistors  in  the  signal  path  and  no 
cascade.  Even  with  a  1  K  ohm  load  resistor,  an 


effective  open-loop  gain  of  over  10,000  is  main¬ 
tained.  The  output  swings  within  100  mV  from 
either  rail.  The  gain-bandwidth  product  is  63  MHz 
with  a  27  pF  load  capacitance  and  9  mW  power 
dissipation.  These  results  are  obtained  using  only 
a  2-V  supply.  The  chip  was  fabricated  in  a  1 .2um 
CMOS  technology. 

CMOS/CCD  Imager  and  Focus-of-Expansion 
Chip  for  Camera  Motion 

Ignacio  McQuirk,  working  with  Professors  Berthold 
Horn  and  Hae-Seung  Lee,  has  continued  develop¬ 
ment  of  a  chip  that  determines  the  direction  of 
camera  translation  directly  from  a  time-varying 
image  sequence  with  a  real-time  analog  VLSI 
system.  His  approach  assumes  a  camera  moving 
through  a  fixed  world  with  only  translational  velocity. 
Also,  the  surfaces  in  the  environment  must  have 
some  spatial  variations  in  reflectance.  The  focus  of 
expansion  (FOE)  is  the  projection  of  the  camera 
translation  vector  onto  the  image  plane  and  hence 
gives  the  direction  of  camera  motion.  The  FOE  is 
the  image  of  the  point  towards  which  the  camera  is 
moving  and  the  point  from  which  other  image  points 
appear  to  be  receding.  Knowledge  of  the  location 
of  the  FOE  in  the  case  of  pure  translation  also 
allows  coarse  calculation  of  the  depth  map  of  the 
imaged  world  up  to  a  scale  factor  ambiguity,  as  well 
as  the  associated  time  to  impact. 

Previously,  various  possible  algorithms  for  esti¬ 
mating  the  FOE  were  compared  in  terms  of  accu¬ 
racy,  robustness,  and  feasibility  for  single-chip 
analog  VLSI  implementation  with  on-chip  imaging 
circuitry.  As  a  result  of  these  studies,  an  algorithm 
based  on  minimizing  the  sum  of  squares  of  the  dif¬ 
ferences  at  stationary  points  between  the  observed 
time  variation  of  brightness  and  the  predicted  vari¬ 
ation  was  chosen  for  implementation.  Additionally, 
several  system  architectures  and  circuit  designs 
were  explored,  and  one  was  chosen  for  realization 
in  analog  VLSI.  Mr.  McQuirk  designed  a  prelimi¬ 
nary  CMOS/CCD  version  of  the  FOE  chip  using 
on-chip  CCD  imaging  circu.'.ry  and  a  row-parallel 
processing  scheme.  Several  components  of  this 
design  were  fabricated  through  MOSIS  and  tested 
with  excellent  results. 

During  the  period  covered  by  this  report,  the  design 
of  the  FOE  chip  was  revised,  and  a  preliminary 
version  of  the  complete  chip  was  also  fabricated 
through  MOSIS.  He  is  currently  in  the  process  of 
designing  and  fabricating  a  test  printed  circuit  board 
for  the  FOE  system  consisting  of  pattern  genera¬ 
tors,  CCD  clock  drivers,  FOE  chip  bias  circuitry,  I/O 
drivers,  and  an  interface  to  the  Vision  Chip  Test 
System.  This  will  allow  for  stand-alone  operation  of 
the  FOE  system  as  well  as  operation  with  the 
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Vision  Chip  Test  System  for  algorithmic,  system, 
and  circuit  level  testing. 

Robust  Analog  Image  Segmentation  Chip 
for  Edge  Detection  and  Camera  Motion 

Lisa  Dron,  working  with  Professor  Horn,  has 
invented  a  robust  method  of  image  segmentation 
that  has  been  shown  to  work  well  in  computer  sim¬ 
ulations.  The  basic  idea  is  to  segment  an  image 
between  two  nodes  of  a  grid  wherever  the  differ¬ 
ence  in  intensities  at  the  two  nodes  exceeds  a 
threshold  at  all  of  several  different  scales  of  spatial 
low-pass  filtering.  (The  threshold  can  vary  with  the 
scale.)  The  method  is  robust  in  the  sense  that  local 
noise  spikes  in  the  image  are  averaged  out  with  fil¬ 
tering  and  do  not  contribute  false  edges,  and  it  is 
accurate  in  the  sense  that  the  unfiltered  image 
plays  a  role,  since  threshold  must  be  exceeded 
there  as  well.  Bandwidth  compression  is  achieved 
by  retaining  intensity  values  only  at  nodes  adjacent 
to  edges.  Intensities  elsewhere  are  obtained  by 
spatial  averaging  using  a  resistive  grid. 

A  preliminary  design  has  been  studied  for  the  seg¬ 
mentation  circuit  using  a  two-dimensional  CCD 
array  with  added  circuitry  for  computing  differences 
and  performing  threshold  tests.  The  structure  of  the 
array  uses  the  same  principles  developed  by  Dr. 
Craig  Keast  for  smoothing  and  segmentation; 
however,  it  has  been  modified  to  implement  the 
multi-scale  veto  (MSV)  algorithm.  In  addition, 
several  improvements  suggested  by  Dr.  Keast  to 
improve  the  performance  of  the  CCD  circuits  have 
been  incorporated.  A  test  chip  containing  a  one¬ 
dimensional  array,  as  well  as  its  isolated  subcir¬ 
cuits,  has  been  fabricated  through  MOSIS  and 
tested.  A  second  chip  with  a  two-dimensional  array 
has  now  been  designed  and  fabricated  and  is  cur¬ 
rently  being  tested. 

The  two  primary  applications  of  the  multi-scale  veto 
(MSV)  chip  are  image  reconstruction  and  edge 
detection.  Ms.  Dron  has  chosen  to  pursue  the 
second  application  in  order  to  design  a  system  for 
computing  general  camera  motion  in  real-time  with 
analog  processing.  The  binary  edge  signals  pro¬ 
duced  by  the  MSV  chip  can  be  used  to  obtain  cor¬ 
respondences  between  successive  frames  by 
matching  patches  of  the  edge  maps  in  the  two 
images.  It  is  well  known  that  it  is  possible  to 
compute  the  relative  camera  motion  between 
frames  from  as  few  as  five  correspondences,  pro¬ 
vided  the  internal  camera  parameters  are  known; 
although  in  practice,  several  more  than  five  are 
used  in  order  to  obtain  a  more  reliable  estimate  of 
the  motion. 


Conventional  digital  methods  for  estimating  camera 
motion  typically  utilize  special  purpose  DSP  chips  to 
find  point  correspondences  by  matching  gray  levels 
over  small  patches  in  the  two  images  using  either 
cross-correlation  or  mean  absolute  value  of  differ¬ 
ence.  Due  to  the  complexity  of  the  computations 
with  8-bit  gray-level  data,  the  maximum  displace¬ 
ment  which  can  be  computed  at  vides  rate  with  a 
single  chip  is  currently  limited  on  commercially 
available  chips  to  around  ±8  pixels  in  any  direction. 
In  addition,  the  displacement  estimates  obtained 
from  matching  gray  levels  are  in  general  less  reli¬ 
able  than  those  which  can  be  obtained  by  matching 
edges.  It  is  possible  to  define  functions  on  two 
patches  of  binary  edges  which  have  a  large  non¬ 
zero  value  only  when  there  is  a  good  match 
between  their  edge  patterns.  Robust  displacement 
estimates  are  obtained  by  identifying  patches  which 
produce  a  well-peaked  unimodal  response  when 
tested  against  all  like-sized  patches  within  a  large 
search  area  in  the  other  image.  An  analog 
processor  can  be  designed  to  effec  ively  implement 
such  a  matching  function  because  it  is  not  neces¬ 
sary  to  accurately  compute  the  exact  value  of  the 
function,  but  only  to  find  the  location  of  its  peak 
value  and  to  determine  if  it  is  significantly  greater 
than  zero. 

Ms.  Di  Ti  has  recently  completed  a  study  of  dif¬ 
ferent  matching  functions  which  can  be  computed 
by  analog  methods  using  the  binary  edge  signals 
produced  by  the  MSV  chip.  She  has  designed 
several  test  structures  which  will  eventually  form  the 
basis  of  the  analog  matching  circuit  and  has  sent 
these  out  for  fabrication  by  MOSIS.  In  addition  to 
designing  the  circuits,  she  has  also  developed  the 
algorithms  needed  to  use  the  point  correspond¬ 
ences  in  a  real-time  system.  She  has  derived  a 
robust  least-squares  method  suitable  for  implemen¬ 
tation  on  a  digital  processor  to  compute  camera 
motion,  as  well  as  an  algorithm  to  determine  the 
internal  camera  calibration  parameters,  required  for 
motion  computation,  from  point  matches  in  a 
sequence  of  images  for  which  the  translational 
motion  is  known. 

Content-Addressable  Parallel  Digital  Image 
Processor 

Massively  parallel  associative  processors  may  be 
well  suited  as  coprocessors  for  accelerating 
machine  vision  applications.  They  achieve  very  fine 
granularity,  as  every  word  of  memory  functions  as  a 
simple  processing  element.  A  dense,  dynamic, 
content-addressable  memory  cell  supports  fully- 
parallel  operation,  and  pitch-matched  word  logic 
improves  arithmetic  performance  with  minimal  area 
cost.  An  asynchronous  reconfigurable  mesh 
network  handles  interprocessor  communication  and 
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image  input/output,  and  an  area-efficient  pass- 
transistor  circuit  counts  and  prioritizes  responders. 

Frederick  P.  Herrmann  has  completed  the  design  of 
a  16K-trit  associative  processing  chip,  under  the 
supervision  of  Professor  Charles  Sodini.  The  chip 
integrates  256  processing  elements,  each  with  64 
tritr  of  memory.  A  16  x  16  mesh  network  connects 
the  processing  .  '•’ments,  and  larger  meshes  may 
be  constructed  using  several  chips. 

The  chip  has  been  submitted  for  fabrication  in  MIT's 
1.5  urn  CCD/CMOS  process.  Fabrication  is  com¬ 
plete,  and  testing  will  begin  in  January  1993. 

Single-Chip  Alignment  Sensor 

Christopher  Umminger,  working  with  Professors 
Sodini  and  Horn,  has  been  researching  implemen¬ 
tations  of  single-chip  sensors  to  measure  alignment 
error  for  tasks  in  automated  manufacturing.  The 
sensor  will  optically  acquire  an  image  of  the  align¬ 
ment  marking  and  output  a  measure  of  the  misa¬ 
lignment.  Currently  they  have  settled  on  two 
methods  for  the  task  of  aligning  a  mark  to  the 
sensor.  The  first  is  based  on  the  surveyor's  mark. 
The  sensor  consists  of  a  four  quadrant  photodiode 
combined  with  an  edge  detection  circuit  to  provide 
extra  alignment  accuracy  in  the  face  of  nonuniform 
illumination.  The  second  sensor  design  uses  a 
square  grating  mark.  A  position  sensitive  Moire 
fringe  pattern  formed  between  the  sensor  and  the 
marking  is  used  to  achieve  accurate  alignment. 

The  first  sensor  design  using  the  surveyor's  mark 
was  finished  in  June  and  submitted  for  fabrication. 
The  completed  chips  were  received  in  mid- 
September  and  preliminary  tests  show  that  the  cir¬ 
cuits  are  working  as  expected.  Characterization  of 
the  sensor  in  an  alignment  system  awaits  the  con¬ 
struction  cf  a  prototype  aligner.  Design  of  the 
Moire  fringe  sensor  will  begin  in  November  and  will 
be  submitted  for  faorication  at  the  end  of  the  year. 

Switched-Capacitor  System  for  Merging 
Dep*h  and  Slope  Estimates 

Mark  Seidel,  working  with  Professor  John  Wyatt 
and  Principal  Research  Scientist  Thomas  Knight, 
has  been  designing  and  fabricating  a  switched- 
capacitor  chip  to  robustly  compute  the  depth  of  a 
scene  by  merging  depth  estimates  (possibly  from  a 
stereo  algorithm)  with  surface  slope  estimates  (pos¬ 
sibly  from  a  shape-from-shading  algorithm).  This 
multi-sensor  image  fusion  design  uses  the  least- 
squares  properties  of  certain  switched-capacitor 
systems  to  compute  a  dense  depth  map  given 
noisy  or  sparse  depth  and  slope  input.  Testing  of  a 
1-D  version  fabricated  through  the  MOSIS  system 


revealed  a  shorting  problem  that  inhibits  further 
testing.  Laser  "surgery"  done  at  MIT  Lincoln  Labo¬ 
ratory  failed  to  resolve  the  origin  of  the  problem.  A 
MOSIS  Tinychip  containinr  some  subcircuits  has 
also  been  fabricated,  but  i.ui  tested.  The  architec¬ 
ture  of  the  chip  was  presented  at  ISCAS  1992  in 
San  Diego.  In  the  final  two  months  of  the  year,  the 
MOSIS  Tinychip  will  be  tested. 

Brightness  Adaptive  CCD  Imager  with 
Analog  and  Digital  Output 

Steven  Decker,  working  with  Professor  Sodini,  has 
been  designing  an  imager  to  act  as  the  front  end  of 
a  modular  vision  system.  It  will  feature  a  64x64 
array  of  pixels,  large  fill  factor,  operating  speed  on 
the  order  of  1000  frames/sec,  column  parallel 
ouput,  a  brightness  adaptation  capability,  and  user 
selectable  analog  or  lO-Nt  digital  output. 

A  CCD-based  algorithmic  A/D  converter  has  been 
developed  which  requires  only  CCD  gates  and  one 
operational  amplifier.  It  avoids  the  need  for  sub¬ 
traction  of  charge  packets  required  by  a  similar  pre¬ 
vious  design,  a  step  which  severely  limited  the 
attainable  accuracy.  Analog  output  can  also  be 
obtained  by  shifting  the  charge  onto  an  integrating 
capacitor  in  a  feedback  loop  around  an  operational 
amplifier.  The  linearity  of  the  analog  output  is  much 
better  than  for  a  conventional  CCD  output  because 
the  poly-poly  capacitor  in  the  feedback  loop  is  much 
more  linear  than  the  diffusion  capacitance  of  a  con¬ 
ventional  CCD  output. 

Th-^  brightness  adaptive  imaging  element  is  based 
on  a  design  that  was  previnu'ly  patented  else¬ 
where.  Each  pi,\„’  is  composed  of  an  MOS 
capacitor  biased  in  deep  depletion  and  an  overflow 
gate.  The  overflow  gate  establishes  a  barrier  to  the 
flow  of  imaged  charge  from  the  main  potential  well 
to  a  sink.  During  the  integration  period,  the  height 
of  the  barrier  (j)(t)  is  varied  as  a  function  of  time.  A 
relationship  between  the  light  intensity  and  final 
imaged  charge  can  be  derived  for  a  given  (j)(t).  In 
particular,  it  is  possible  to  obtain  a  logarithmic  com¬ 
pression  curve  so  that,  ideally,  any  light  intensity 
can  be  sensed  without  saturating  the  imager.  The 
advantages  of  this  technique  oyer  other  schemes 
which  have  been  considered  are  simplicity,  flexi¬ 
bility,  and  small  size. 

Efforts  have  been  directed  towards  designing  a 
practical  realization  of  the  brightness  adaptive 
imager.  In  the  original  patent,  (()(t)  is  assumed  to  be 
a  continuous,  infinitely  precise  function  of  time.  The 
implications  of  approximating  (^(\)  with  a  digital 
signal  have  been  examined.  Using  the  digital  gener¬ 
ation  scheme  and  digital  output,  reconstruction  of 
the  original  light  intensity  from  the  imaged  charge  is 
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easy  since  this  can  all  be  done  digitally.  A  layout 
scheme  which  allows  column  parallel  output  while 
maintaining  a  fill  factor  exceeding  50  percent  has 
been  achieved. 


1.2.3  Mixed  Circuit/Device  Simulation  for 
CMOS/CCD  Systems 

Andrew  Lumsdaine  and  Mark  Reichelt,  under  the 
supervision  of  Professor  Jacob  K.  White,  are 
working  on  algorithms  for  accelerating  mixed  cir¬ 
cuit/device  simulation,  as  this  kind  of  analysis  is 
required  to  properly  simulate  the  performance  of 
CCD-based  analog  vision  circuits.  Initially,  they 
have  focused  on  developing  parallelizable  algo¬ 
rithms  for  semiconductor  device  transient  simu¬ 
lation,  and  are  investigating  waveform  relaxation 
(WR),  a  natural  candidate.  They  found  exper¬ 
imental  evidence  that  accelerating  WR  convergence 
using  standard  overrelaxaticn  can  produce  oscilla¬ 
tory  results  and  have  recently  developed  methods 
for  eliminating  this  phenomenon.  In  particular,  they 
have  developed  two  new  techniques  for  acceler¬ 
ating  WR  convergence,  one  based  on  a  generalized 
conjugate-direction  method  and  the  other  based  on 
a  frequency-dependent  successive  over-relaxation 
(SOR)  algorithm.  Proofs  of  the  optimality  of  the 
SOR  algorithm  and  the  guaranteed  convergence  of 
the  C-D  method  have  been  completed,  and  exper¬ 
imental  results  indicate  that  both  methods  can 
reduce  the  number  of  waveform  iterations  as  much 
as  an  order  of  magnitude. 


1.2.4  Vision  Chip  Test  System 

In  order  to  test  VLSI  early  vision  processors  that 
are  being  developed  by  this  project,  it  is  important 
to  input  synthetic  data  or  real  image  data  to  the 
chip  and  study  processed  image  output  from  the 
chip.  Kamyar  Eshghi  and  Professor  Sodini  have 
constructed  a  Demonstration  System  for  Early 
Vision  processors  (DSEV)  that  will  perform  this 
high-level  testing.  DSEV  can  capture  images  from 
video  cameras  and  transmit  them  to  a  device  under 
test  (DUT).  Processed  images  retrieved  from  the 
DUT  are  stored  by  DSEV  and  may  be  displayed  for 
visual  comparison  to  the  input  image.  Also,  the 
same  images  that  were  input  to  the  DUT  may  be 
processed  by  software  algorithms  on  a  workstation 
and  compared  with  the  DUT  output,  allowing  the 
sensitivities  of  the  integrated  circuit  design  to  be 
studied.  DSEV  has  been  standardized  in  such  a 
way  that  a  minimum  of  effort  is  required  to  interface 
various  vision  chips  to  it. 

The  hardware  of  DSEV  consists  of  a  camera, 
display  monitor,  frame  grabber,  display  interface. 


image  memory,  SUN  workstation,  S-bus  to  VME 
adapter,  MaxBus  interface,  DUT  interface,  and  DUT 
board.  A  MaxVideo20  module  was  purchased  from 
DataCube  Incorporated,  which  includes  a  frame 
grabber,  display  interface  and  image  memory.  Soft¬ 
ware  has  been  written  to  control  MaxVideo20,  and 
the  MaxBus  interface  board  and  DUl  interface 
boards  were  designed,  built  and  tested. 

Each  student  will  have  to  design  and  build  his  or 
her  own  DUT  board  specifically  for  the  chip  to  be 
tested,  but  the  rest  of  the  system  is  shared,  at  a 
substantial  savings  in  overall  group  effort.  Ignacio 
McQuirk  is  now  modifying  and  altering  the  system 
for  use  in  testing  his  motion  chip.  He  is  currently 
moving  the  level-shifting  and  scaling  circuitry  from 
the  DUT  board  to  the  A/D  and  D/A  converter 
boards  and  building  more  of  those  boards. 


1.2.5  Novel  Camera  Calibration  Method 
Using  Images  of  Lines  and  Spheres 

A  stereo  vision  system  and  two  systems  for  camera 
motion  determination  were  described  earlier  in  this 
report.  Both  these  tasks  require  accurate  know¬ 
ledge  of  camera  parameters,  .  •♦'cally,  principal 
point,  focal  length  and  distortions  such  as  horizontal 
scaling  and  radial  distortion,  for  any  analog  or 
digital  implementation. 

Although  camera  calibration  is  important  for  3D 
measurements,  it  is  often  neglected  because  it  is  a 
laborious  process  and  requires  special  apparatus 
and  skill.  The  aim  of  this  project  is  to  devise  a 
method  for  calibration  which  will  be  simple  to 
perform  and  accurate  to  better  than  one  percent 
error  in  camera  parameters. 

Most  of  the  techniques  currently  used  in  the  field  of 
robotics  use  images  of  a  3D  object  whose  world 
coordinates  are  accurately  known  and  tfien  proceed 
to  find  both  the  camera  parameters  and  the  position 
and  orientation  of  the  camera  relative  to  the  3D 
object.  These  methods  have  a  few  inherent  weak¬ 
nesses.  It  is  often  hard  to  decouple  the  camera 
parameters  from  the  camera  orientation  parameters 
(external  parameters).  To  have  any  chance  of 
success,  a  very  precise  3D  object  is  required  with 
easily  identifiable  calibration  points.  These  objects 
are  relatively  hard  to  obtain,  and  the  number  of  cali¬ 
bration  points  in  the  object  is  small  (on  the  order  of 
100). 

Gideon  Stein  has  been  working  with  Professors 
Wyatt  and  Horn  to  develop  new  techniques  for  cali¬ 
bration  of  internal  camera  parameters  which  will  not 
require  finding  all  the  external  calibration  para¬ 
meters. 
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The  first  method  finds  the  radial  distortion  para¬ 
meters.  The  image  of  straight  lines  under  perspec¬ 
tive  projection  are  straight  lines.  In  real  images 
these  lines  are  curved  due  to  radial  distortion.  By 
correcting  for  the  radial  distortion  the  lines  can  be 
made  straight.  This  is  an  adaptation  of  the  "plumb 
line"  method  known  in  the  literature. 

The  second  method  finds  the  focal  length  and  the 
principal  point.  It  assumes  the  image  has  been  cor¬ 
rected  for  radial  distortion.  The  image  of  a  sphere 
under  perspective  projection  is  an  ellipse  whose 
major  axis  is  on  a  line  which  passes  through  the 
principal  point.  The  eccentricity  of  the  ellipse  is  a 
.unction  of  the  focal  length.  By  taking  the  image  of 
a  number  of  spheres  the  intersection  of  the  lines 
passing  through  the  major  axis  gives  the  principle 
point.  The  focal  length  can  be  found  from  the 
eccentricity.  This  method  finds  the  principal  point  to 
within  one  pixel  but  so  far  does  not  provide  accu¬ 
rate  estimates  of  the  focal  length. 

The  third  method  involves  rotating  the  camera 
around  the  y-axis  through  a  known  angle.  By 
tracking  points  in  the  scene  the  focal  length  and 
radial  distortion  can  be  recovered.  Currently  the 
estimates  are  repeatable  to  within  0.5  percent.  To  a 
lesser  degree  of  accuracy,  the  principal  point  can 
also  be  recovered.  This  method  is  suitable  for  self 
calibration  of  autonomous  robots  working  in 
unstructured  outdoor  environments. 

Future  work  involves  refining  these  three  methods 
and  evaluating  their  performance  compared  to  the 
more  traditional  methods. 


1.2.6  Analog  versus  Digital  Approaches  to 
Machine  Vision  Hardware 

Recent  technical  progress  in  analog  vision  chips 
has  stimulated  comparisons  of  analog  and  digital 
processing  schemes  through  case  studies.  The 
analog  vision  chips  have  demonstrated  their  small 
cost  in  silicon  area  and  high  speed  characteristics 
for  some  applications.  The  goal  of  this  project  is  to 
identify  the  capabilities  and  limitations  of  the  analog 
chips  and  to  find  what  application  areas  would  be 
suitable  for  analog,  digital,  and  hybrid  chips, 
respectively. 

In  collaboration  with  Professors  Charles  Sodini, 
Berthold  Horn,  and  John  Wyatt,  David  A.  Martin, 
under  the  supervision  of  Professor  Hae-Seung  Lee, 
and  Ichiro  Masaki,  visiting  scientist  from  General 
Motors,  chose  direct  image  correlation  and  three- 
dimensional  stereo  vision  algorithms  as  the  cases. 
These  algorithms  are  frequently  used  in  various 
applications  including  intelligent  vehicles  and  flex¬ 
ible  manufacturing  systems. 


As  the  first  step,  two  algorithms  were  compared  at 
a  conceptual  level:  a  binary  algorithm  for  digitu 
implementation  and  a  multi-valued  algorithm  for 
analog  chips.  The  computer  simulation  is  being 
developed  for  this  algorithm  level  comparison.  The 
chip  implementation  level  comparison  is  also 
undenway.  Currently  a  Multi-Instruction  Multi-Data 
Array  Processor  architecture,  implemented  in  both 
analog  and  digital,  is  being  studied.  Each  pro¬ 
cessing  element  includes  an  arithmetic  unit,  a  data 
flow  control  unit,  and  a  system  control  unit  which 
includes  program  memory. 

Our  next  goals  include  finalization  of  the  chip  archi¬ 
tecture,  design  of  the  chips,  and  performance  com¬ 
parisons  based  on  simulations  of  the  designed 
chips.  The  criteria  for  the  comparisons  will  include 
cost  in  silicon  area,  processing  speed,  manufac¬ 
turing  and  development  costs,  testability,  and  other 
related  issues. 


1.2.7  Solutions  of  Theoretical  Problems  in 
Analog  Hardware  and  Machine  Vision 

Gibbs  Random  Field  Models 

Gibbs  random  fields  (GRFs)  are  probabilistic 
models  inspired  by  statistical  mechanics  and  used 
to  model  images  in  computer  vision  and  image  pro¬ 
cessing.  In  this  research,  conducted  by  Ibrahim 
Elfadel  under  the  supervision  of  Professors  John 
Wyatt,  Berthold  Horn  and  Alan  Yuille,  the  analytical 
methods  of  statistical  mechanics  are  brought  to 
bear  on  these  models.  Specifically,  the  following 
fundamental  problems  have  been  addressed  and 
contributions  have  been  made  to  their  solutions: 

1.  Mean-field  estimation  of  a  constrained  GRF 
model:  The  configuration  space  of  a  GRF 
model  is  often  constrained  to  produce  "inter¬ 
esting"  patterns.  Mr.  Elfadel  has  developed 
mean-field  equations  for  estimating  the  means 
of  these  constrained  GRFs.  The  novel  feature 
of  these  equations  is  that  the  finiteness  of 
graylevels  is  incorporated  in  a  "hard"  way  in 
the  equations. 

2.  Correlation-field  estimation  of  a  GRF  model: 
GRF  correlation  functions  are  generally  hard  to 
compute  analytically  and  expensive  to  compute 
numerically.  Mr.  Elfadel  has  used  the  mean- 
field  equations  developed  above  to  propose  a 
new  procedure  for  estimating  these  correlation 
functions.  His  procedure,  which  is  valid  for 
both  unconstrained  and  constrained  models,  is 
applied  to  the  quadratic  interaction  model,  and 
a  new  closed-form  approximation  for  its  corre- 
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lation  function  in  terms  of  the  model  para¬ 
meters  is  derived. 

3.  Network  representation:  Mr.  Elfadel  has  shown 
how  the  mean-field  equations  of  the  GRF 
model  can  be  mapped  onto  the  fixed  point 
equations  of  an  analog  winner-take-all  (WTA) 
network.  The  building  block  of  this  network  is 
the  generalized  sigmoid  mapping,  a  natural 
generalization  of  the  sigmoidal  function  used  in 
artificial  neural  networks.  This  sigmoidal 
mapping  has  a  very  simple  VLSI  circuit  imple¬ 
mentation  with  desirable  circuit-theoretic  prop¬ 
erties  such  as  reciprocity  and  local  passivity. 

4.  Solution  algorithms:  Iterated  map  methods  and 
ordinary  differential  equations  (ODEs)  are  pro¬ 
posed  to  solve  the  network  fixed-point  equa¬ 
tions.  In  the  former,  Mr.  Elfadel  has  shown, 
using  Lyapunov  stability  theory  for  discrete 
systems,  that  the  worst  that  could  happen 
during  synchronous  iteration  is  an  oscillation  of 
period  2.  In  the  latter,  he  has  shown  that  the 
ODEs  are  the  gradient  descent  equations  of 
energy  functions  that  can  be  derived  from  the 
mean-field  approximation.  One  of  his  gradient 
descent  algorithms  can  be  interpreted  as  the 
generalization  to  analog  WTA  networks  of 
Hopfield's  well-known  algorithm  for  his  analog 
network. 

5.  Temperature  dependence:  The  GRF  temper¬ 
ature  parameter  reflects  the  thermodynamic 
roots  of  the  model.  Using  eigenstructure  anal¬ 
ysis,  he  has  studied  the  temperature  effect  on 
the  stability  of  the  WTA  network  fixed  points. 
In  particular,  he  derived  new  closed-form  for¬ 
mulas  for  the  critical  temperature  of  a  large 
class  of  models  used  in  grayscale  texture  syn¬ 
thesis.  The  stability  study  is  used  to  gain 
insight  into  the  phase  transition  behavior  of 
GRFs. 

The  technical  details  of  these  results  will  appear  in 
Ibrahim  Elfadel's  Ph.D.  thesis  titled  From  Random 
Fields  to  Networks,  which  was  submitted  in  Feb¬ 
ruary  1993. 

In  future  work,  Mr.  Elfadel  will  investigate  the  impli¬ 
cations  of  these  results  in  the  fields  of  image  mod¬ 
eling,  optimization,  and  analog  hardware 
implementation  of  image  p  ocessing  and  computer 
vision  algorithms. 


Switched-Capacitor  Network  Theory 

Mark  Seidel,  working  with  Professor  John  Wyatt, 
has  been  studying  general  networks  of  switches, 
capacitors,  and  sources.  These  networks  are 
important  in  the  implementation  of  computational 
and  image  processing  problems  formulated  as  min¬ 
imization  algorithms.  For  nonpathological  networks, 
conditions  for  stability  during  each  clock  phase 
have  been  proven.  Stability  for  the  discrete  time 
system  has  been  proven  for  a  large  subclass  of 
these  networks.  We  also  expect  to  prove  stability 
for  general  networks. 

Under  constant  input  conditions,  the  switched- 
capacitor  networks  can  be  shown  to  be  equivalent 
to  networks  comprised  of,  in  general,  nonreciprocal 
passive  resistive  multiports  and  multiterminal  ele¬ 
ments.  Each  such  element  is  associated  with 
exactly  one  of  the  capacitors  in  the  network.  The 
transformation  to  these  resistive  networks  is  equiv¬ 
alent  in  the  sense  that  all  node  voltages  are  the 
same,  and  all  currents  are  equal  to  the  corre¬ 
sponding  average  transferred  charge  over  the 
period.  This  equivalence  indicates  certain  rules  for 
the  reverse  problem  of  synthesizing  a  switched 
capacitor  network  from  a  resistive  network.  This 
synthesis  can  always  be  accomplished  for  networks 
of  two-terminal  resistors  and  lossy  two-port  transfor¬ 
mers.  It  has  also  been  shown  that  even  if  the 
resistive  network  to  be  implemented  includes  only 
the  nonreciprocal  passive  resistive  elements  dis¬ 
cussed  earlier,  clock  phase  assignment  is  some¬ 
times  impossible.  We  will  be  generating  several 
examples  and  writing  a  journal  article. 

Finally,  additional  work  has  been  completed  on 
bounding  the  discrete  time  system  settling  time  of 
large  switched  capacitor  networks.  The  bounds 
have  been  completely  inverted  and  can  be  shown 
to  be  equivalent  to  the  continuous  time  bounds  in 
the  appropriate  limit.  We  will  be  generating  several 
examples. 

Generating  a  Uniformly  Spaced  Set  of  3-D 
Rotations  for  Object  Recognition  Purposes 

In  some  pattern  recognition  tasks,  e.g.,  vehicle 
identification  from  aerial  photography,  one  wishes  to 
decide  which  object  from  a  finite,  previously  deter¬ 
mined  set  is  represented  in  some  portion  of  a  single 
image.  One  seeks  the  best  match  by  rotating  each 
of  the  known  objects  to  various  positions,  creating 
2-D  images  in  various  sizes,  and  finding  the  best 
match.  One  problem  is  that  there  is  no  known  way 
of  writing  down  closed  form  expressions  for  a  uni¬ 
formly  spaced  set  of  N  rotations  in  three  dimen¬ 
sions,  except  for  very  special  values  of  N 
corresponding  to  the  number  of  faces  on  a  regular 
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polyhedron.  (The  solution  in  two  dimensions  is 
trivial— the  rotations  are  through  angles  of 
(K/N)-360\  where  K  =  1,2 . N  -  1.) 

Gene  Osgood,  working  with  Professors  Wyatt  and 
Horn,  has  explored  an  iterative  method  for 
producing  uniformly  spaced  rotations  using  the 
quaternion  number  system.  In  this  system  each 
rotation  in  three  dimensions  is  represented 
abstractly  as  a  point  on  the  three-dimensional 
sphere  in  four  dimensions.  Gene  represents  each 
point  as  a  positive  point  charge  and  simulates  the 
movement  of  these  charges  under  mutual  repulsion 
as  they  spread  apart  on  the  surface  of  the  sphere. 
The  method  yields  a  substantial  savings  in  com¬ 
puter  time  over  conventional  approaches  that  use 
unevenly  spaced  points. 
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Enormous  computational  expense  and  the  growing 
importance  of  mixed  circuit/device  simulation,  as 
well  as  the  increasing  availability  of  parallel  com¬ 
puters,  suggest  that  specialized,  easily  parallelized 
algorithms  should  be  developed  for  transient  simu¬ 
lation  of  MOS  devices.  In  earlier  work  on  the 
WORDS  program  (Waveform  Overrelaxation  Device 
Simulator),  the  easily  parallelized  waveform  relaxa¬ 
tion  (WR)  algorithm  was  shown  to  be  a  computa¬ 
tionally  efficient  approach  to  device  transient 
simulation  even  on  a  serial  machine.  However,  the 
WR  algorithm  typically  requires  hundreds  of  iter¬ 
ations  to  achieve  an  accurate  solution. 

In  order  to  use  WORDS  in  a  mixed  circuit/device 
simulator,  we  have  been  investigating  ways  of 
making  WORDS  more  robust  and  efficient.  We 
determined  how  to  compute  the  terminal  currents 
accurately  using  different  timepoints  at  different 
mesh  nodes.  We  also  improved  the  timestep  selec¬ 
tion  procedure  by  determining  how  to  refine  the 
timesteps  as  WR  iterations  proceed  (reducing  the 
total  computation  by  as  much  as  a  factor  of  2  by 
using  only  a  few  coarse  timesteps  in  early  iter¬ 
ations).  The  more  accurate,  electric  field  depen¬ 
dent  mobility  model  was  also  implemented.  Recent 
work  on  theoretical  aspects  of  these  methods  have 
answered  several  long-standing  questions  about 
muitirate  stability.® 


8  M.  Reichelt,  F.  Odeh,  and  J.  White,  "A-Stability  of  Multirate  Integration  Methods,  with  Application  to  Parallel  Semiconductor  Device 
Simulation,"  (Invited  paper)  Proceedings  of  the  SIAM  Meeting  on  Parallel  Processing  for  Scientific  Computing,  Norfolk,  Virginia, 
March  1993. 
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Through  experiments,  we  found  evidence  that  WR, 
using  standard  overrelaxation  acceleration,  can 
produce  oscillatory  results  and  are  investigating 
methods  for  eliminating  this  phenomenon.  A 
frequency-dependent  overrelaxation  algorithm  using 
lowpass  filtering  was  developed®  as  well  as  a  wave¬ 
form  conjugate-direction  approach.'®  Experimental 
results  indicate  that  both  approaches  reduce  the 
number  of  waveform  iterations  required  by  more 
than  a  factor  of  seven.  Finally,  experimental  results 
using  a  parallel  computer  show  that,  although  the 
accelerated  WR  methods  are  only  competitive  with 
the  best  of  the  standard  algorithms  for  device  tran¬ 
sient  simulation  on  a  serial  machine,  the  WR  algo¬ 
rithms  are  substantially  faster  on  a  parallel 
machine." 


1.4  Numerical  Simulation  of  Short 
Channel  MOS  Devices 

Sponsors 

IBM  Corporation 
National  Science  Foundation 
Fellowship  MIP  88-58764 
U.S.  Navy 

Contract  N00174-92-Q-0133 

Project  Staff 

Khalid  Rahmat,  Professor  Dimitri  A.  Antoniadas, 
Professor  Jacob  K.  White 

The  model  used  in  conventional  device  simulation 
programs  is  based  on  the  drift-diffusion  model  of 
electron  transport,  but  this  model  does  not  accu¬ 
rately  predict  the  field  distribution  near  the  drain  in 
small  geometry  devices.  This  is  of  particular  impor¬ 
tance  for  predicting  oxide  breakdown  due  to  pene¬ 
tration  by  "hot"  electrons.  There  are  two 
approaches  for  more  accurately  computing  the  elec¬ 
tric  fields  in  MOS  devices,  one  is  based  on  adding 
an  energy  equation  to  the  drift-diffusion  model,  and 
the  second  is  based  on  direct  solution  of 
Boltzman's  equation. 


In  the  first  approach,  an  energy  balance  equation  is 
solved  along  with  the  drift-diffusion  equations  so 
that  the  electron  temperatures  are  computed  accu¬ 
rately.  This  combined  system  is  numerically  less 
tame  than  the  standard  approach  and  must  be 
solved  carefully.  We  have  developed  a  two- 
dimensional  device  simulator  in  which  an  energy 
balance  equation  is  solved  for  electron  temperature 
along  with  the  usual  drift-diffusion  equations.  The 
program  avoids  temperature  instabilities  produced 
by  previous  discretization  approaches  through  the 
use  of  a  careful  application  of  exponential-fitting  to 
the  energy  equation.  Drain  cu!'rents  for  silicon 
MOSFETs  predicted  by  the  simulator,  using  one  set 
of  model  parameters,  match  well  with  experimental 
data  for  devices  over  a  range  of  channel  lengths 
from  0.90  urn  to  0.16  urn.  Also,  a  method  to 
compute  substrate  current  has  been  derived  which 
uses  the  electron  temperature  provided  by  the  sim¬ 
ulator.  The  computed  substrate  currents  match  well 
with  measured  data,  for  the  regime  above  sub¬ 
threshold,  for  MOSFET's  with  channel  lengths  as 
short  as  0.16  nm.'^ 

The  limited  ability  to  accurately  predict  hot-electron 
effects  from  an  energy-balance  based  simulator  has 
led  to  our  current  investigation.  We  are  now  trying 
to  solve  the  full  Boltzman  equation,  using  a  spher¬ 
ical  harmonics  based  approach  in  portions  of  the 
MOS  device  which  are  likely  to  create  hot  carriers. 


1.5  Circuit  Simulation  Algorithms 

Sponsors 

IBM  Corporation 
National  Science  Foundation 
Fellowship  MIP  88-58764 

Project  Staff 

Luis  M.  Silveira,  Steven  B.  Leeb,  Professor  Jacob 
K.  White 

A  challenging  problem  in  the  area  of  analog  circuits 
is  the  simulation  of  clocked  analog  circuits  like 


9  M.  Reichelt,  J.  White,  and  J.  Allen,  "Frequency-Dependent  Waveform  Overrelaxation  for  Transient  Two-Dimensional  Simulation  of 
MOS  Devices,"  Proceedings  of  NUPAD  IV,  Seattle,  Washington,  May  1992,  pp.  161-166. 

'0  A.  Lumsdaine  and  J.  White,  "Accelerating  Dynamic  Iteration  Methods  with  Application  to  Semiconductor  Device  Simulation,"  (Third 
place,  best  student  paper  competition)  Proceedings  of  the  Copper  Mountain  Conference  on  Iterative  Methods.  Copper  Mountain, 
Colorado,  April  1992. 

"  A.  Lumsdaine  and  M.  Reichelt,  "Waveform  Iterative  Techniques  for  Device  Transient  Simulation  on  Parallel  Machines,"  Invited  paper. 
Proceedings  of  the  SIAM  Meeting  on  Parallel  Processing  for  Scientific  Computing,  Norfolk,  Virginia,  March  1 993. 

'2  K.  Rahmat,  J.  White,  and  D.  Antoniadis,  "Simulation  of  Very  Short  Channel  MOSFETs  Including  Energy  Balance,"  IEEE  Trans. 
Comput. -Aided  Des.,  forthcoming. 
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switching  filters,  switching  power  supplies,  and 
phase-locked  loops.  These  circuits  are  computa¬ 
tionally  expensive  to  simulate  using  conventional 
techniques  because  these  kinds  of  circuits  are  all 
clocked  at  a  frequency  whose  period  is  orders  of 
magnitude  smaller  than  the  time  interval  of  interest 
to  the  designer.  To  construct  such  a  long  time  sol¬ 
ution,  a  program  like  SPICE  or  ASTAP  must  calcu¬ 
late  the  behavior  of  the  circuit  for  many  high 
frequency  clock  cycles.  The  basic  approach  to 
making  simulation  of  these  circuits  more  efficient  is 
to  exploit  only  the  property  that  the  similar  behavior 
of  such  a  circuit  in  a  given  high  frequency  clock 
cycle,  but  not  identical,  to  the  behavior  in  the  pre¬ 
ceding  and  following  cycles.  Therefore,  by  accu¬ 
rately  computing  the  solution  over  a  few  selected 
cycles,  an  accurate  long  term  solution  can  be  con¬ 
structed.  Such  approaches  are  known  as 
"envelope-following"  algorithms. 

In  our  recent  work,  we  are  trying  to  make  the 
envelope-following  algorithm  more  robust  and  effi¬ 
cient  by  exploiting  the  fact  that  the  envelope  of 
"quasi-algebraic"  components  in  the  solution  vector 
need  not  be  computed.  An  automatic  method  for 
determining  the  quasi-algebraic  solution  compo¬ 
nents  has  been  derived,  and  experimental  results 
demonstrate  that  this  modified  method  reduces  the 
number  of  computed  clock  cycles  when  applied  to 
simulating  closed-loop  switching  power  converters.'^ 

1.6  Parallel  Circuit  Simulation 
Algorithms 

Sponsor 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N0001 4-91 -J-1 698 

Project  Staff 

Luis  M.  Silveira,  Ricardo  Telichevsky,  Professor 
William  J.  Dally,  Professor  Jacob  K.  White 

We  are  taking  an  application-oriented  approach  to 
developing  parallel  numerical  algorithms  and 
focusing  on  circuit  and  device  simulation.  Applica¬ 
tion  programs  and  techniques  are  being  developed 
for  both  massively  parallel  machines  SIMD 
machines  like  the  Connection  Machine  or  MIMD 
machines  like  the  Intel  hypercube.  In  addition,  we 


are  also  trying  to  understand  fundamental  aspects 
of  the  interaction  between  architecture  and  certain 
numerical  algorithms. 

For  example,  the  direct  solution  of  circuit  simulation 
matrices  is  particularly  difficult  to  parallelize,  in  part 
because  methods  like  parallel  nested  dissection  are 
ineffective  due  to  the  difficulty  of  finding  good  sepa¬ 
rators.  For  that  reason,  general  sparse  matrix  fac¬ 
torization  techniques  are  being  studied  and,  in 
particular,  the  interaction  between  sparse  matrix 
data  structures,  computer  memory  structure,  and 
multiprocessor  communication  is  being  investigated. 
To  focus  this  work,  a  special-purpose  processor  for 
circuit  simulation,  the  Numerical  Engine  (NE),  is 
under  development.  Preliminary  design  is  com¬ 
plete,  and  register-transfer  level  simulation  results 
indicate  that  the  specialized  processor  can  achieve 
up  to  80%  of  its  peak  floating-point  performance  for 
sparse  matrix  factorization  and  nearly  90%  of  its 
peak  performance  on  model  evaluation. 

Explicit  integration  methods  avoid  matrix  solution 
and,  therefore,  are  also  interesting  algorithms  to 
use  on  parallel  processors.  For  this  reason,  we 
investigated  some  properties  of  the  recently  devel¬ 
oped  explicit  exponentially-fit  integration  algorithms. 
The  results  were  not  very  encouraging,  although 
our  theoretical  investigation  yielded  several  new 
insights  about  these  methods.’*' 


1.7  Microelectromechanical  CAD 
(MEMCAD) 

Sponsors 

Federal  Bureau  of  Investigation 
Contract  J-FBI-92-196 
National  Science  Foundation 
Fellowship  MIP  88-58764 

Project  Staff 

Xuejun  Cai,  Mattan  Kamon,  Professor  Martin  A. 
Schmidt,  Professor  Stephen  D.  Senturia,  Professor 
Jacob  K.  White 

High  fabrication  costs  and  increasing  microsensor 
complexity  is  making  computer  simulation  of  the 
realistic  geometries  necessary,  both  to  investigate 
design  alternatives  and  to  perform  verification 
before  fabrication.  At  MIT,  we  are  developing  a 


'3  M.  Silveira,  J.  White,  and  S.  Leeb,  "A  Modified  Envelope-Following  Approach  to  Clocked  Analog  Circuit  Simulation,"  Proceedings  of 
the  International  Conference  on  Computer-Aided  Design,  Santa  Clara,  California,  November  1991,  pp.  20-23. 
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11(5);  566-574  (1992). 
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MicroElectroMechanical  Computer-Aided  Design 
(MEMCAD)  system  to  make  it  possible  for  micro¬ 
sensor  designers  to  easily  perform  realistic  simu¬ 
lations.  Carefully  selected  commercial  software 
packages  have  been  linked  with  specialized  data¬ 
base  and  numerical  programs  to  allow  a  designer  to 
easily  enter  a  three-dimensional  microsensor  geom¬ 
etry  and  quickly  perform  both  mechanical  and  elec¬ 
trical  analyses.  The  system  currently  performs 
electromechanical  analyses,  such  as  calculating  the 
capacitance  versus  pressure  (or  force)  curve  for  a 
square  diaphragm  deformed  by  a  differential  pres¬ 
sure  and  can  be  used  to  calculate  levitation  forces 
in  structures  as  complicated  as  a  comb  drive.'® 

1.8  Numerical  Techniques  for 
Simulating  Josephson  Junction  Arrays 

Sponsors 

National  Defense  Science  and  Engineering 
Graduate  Fellowship 
National  Science  Foundation 
Fellowship  MIP  88-58764 

Project  Staff 

Joel  R.  Phillips,  Herre  S.J.  van  der  Zant,  Professor 
Terry  P.  Orlando,  Professor  Jacob  K.  White 

Vortices  play  a  central  role  in  determining  the  static 
and  dynamic  properties  of  two-dimensional  (2D) 
superconductors.  Artificially  fabricated  networks  of 
superconducting  islands  weakly  coupled  by 
Josephson  junctions  are  model  systems  to  study 
the  behavior  of  vortices.  Through  simulation,  we 
have  discovered  that  the  static  properties  of 
vortices  in  an  array  of  Josephson  junctions  can  be 
significantly  influenced  by  magnetic  fields  induced 
by  vortex  currents.  The  energy  barrier  for  vortex 
motion  is  enhanced,  nearly  doubling  for  penetration 
depths  on  the  order  of  a  cell  size.  Moreover,  we 
have  found  that  correct  calculation  of  the  vortex 
current  distribution,  the  magnetic  moment,  and  the 
lower  critical  field  require  modeling  mutual  induc¬ 
tance  interactions  between  all  cell  pairs  in  the  array. 


To  make  numerical  simulation  of  the  system  with  all 
inductive  effects  computationally  feasible,  a  novel 
FFT-accelerated  integral  equation  solver  was 
derived.  This  algorithm  is  sufficiently  efficient  to 
allow  study  of  large  (500  x  500  cells)  arrays.’® 


1.9  Efficient  3-D  Interconnect  Analysis 

Sponsors 

Digital  Equipment  Corporation 
IBM  Corporation 
National  Science  Foundation 
Fellowship  MIP  88-58764 
U.S.  Navy  -  Office  of  Naval  Research 
Contract  NOO0 1 4-9 1-J- 1698 

Project  Staff 

Mattan  Kamon,  Songmin  Kim,  Keith  S.  Nabors,  Joel 
R.  Phillips,  Professor  Jacob  K.  White 

We  have  developed  multipole-accelerated  algo¬ 
rithms  for  computing  capacitances  and  inductances 
of  complicated  3-D  geometries  and  implemented 
these  algorithms  in  the  programs  FASTCAP  and 
FASTHENRY.  The  methods  are  accelerations  of 
the  boundary-element  or  method-of-moments  tech¬ 
niques  for  solving  the  integral  equations  associated 
with  the  multiconductor  capacitance  or  inductance 
extraction  problem.  Boundary-element  methods 
become  slow  when  a  large  number  of  elements  are 
used  because  they  lead  to  dense  matrix  problems 
which  are  typically  solved  with  some  form  of 
Gaussian  elimination.  This  implies  that  the  compu¬ 
tation  grows  as  n®,  where  n  is  the  number  of  panels 
or  tiles  needed  to  accurately  discretize  the  con¬ 
ductor  surface  charges.  Our  new  algorithms,  which 
use  generalized  conjugate  residual  iterative  algo¬ 
rithms  with  a  multipole  approximation  to  compute 
the  iterates,  reduces  the  complexity  so  that  accu¬ 
rate  multiconductor  capacitance  and  inductance  cal¬ 
culations  grow  nearly  as  nm  where  m  is  the  number 
of  conductors.  For  practical  problems  which  require 
as  many  as  10,000  panels  or  filaments,  FASTCAP 
and  FASTHENRY  are  more  than  two  orders  of 
magnitude  faster  than  standard  boundary-element 


15  S.D.  Senturia,  R.M,  Harris,  B.P.  Johnson,  S.  Kim,  K.  Nabors,  M.A.  Shulman,  and  J.K.  White,  "A  Computer-Aided  Design  System  for 
Microelectromechanical  Systems  (MEMCAD),"  IEEE  J.  Microelectromech.  Sysl.  1(1):  3-13  (1992);  J.  R.  Gilbert,  P.M.  Osterberg, 
R.M.  Harris,  D.O.  Ouma,  X.  Cai,  A.  Pfajfer,  J.  White,  and  S.D.  Senturia,  "Implementation  of  a  MEMCAD  System  for  Electrostatic  And 
Mechanical  Analysis  of  Complex  Structures  From  Mask  Descriptions,"  Proceedings  of  the  IEEE  Micro  Electro  Mechanical  Systems 
Conference,  Fort  Lauderdale,  Florida,  February  1993. 
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Junction  Arrays,"  Phys.  Rev.  B.  Forthcoming. 
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based  programs.'^  Manuals  and  source  code  for 
FASTCAP  and  FASTHENRY  are  available  directly 
from  MIT. 


1.10  Techniques  for  Logic  Synthesis, 
Format  Verification  and  Testing 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Contract  N001 4-91 -J-1 698 
IBM  Corporation  Grant/Fellowship 

Project  Staff 

Professor  Srinivas  Devadas,  Mazhar  M.  Alidina, 
Vishal  L.  Bhagwati,  Kelly  S.  Bai,  Stan  Y.  Liao,  Jose 
C.  Monteiro,  Amelia  H.  Shen,  Dr.  Filip  J.  Van  Aelten 


1.10.1  Introduction 

The  design  of  microelectronic  integrated  circuits  is  a 
complex  process  because  the  final  product  must 
meet  many  different  requirements  and  specifica¬ 
tions.  Specifications  concerning  the  size  and  per¬ 
formance  of  a  circuit  are  invariably  placed  on  the 
design  process.  For  example,  reliability  restrictions 
requiring  that  a  circuit  be  testable  for  a  chosen  fault 
model  place  limitations  on  the  design  process. 
With  an  increasing  number  of  transistors  being  fab¬ 
ricated  within  a  small  area  on  a  silicon  substrate, 
the  power  consumption  of  a  circuit  has  become  a 
very  important  design  parameter.  Indeed,  within 
the  next  decade,  the  power  consumed  by  a  circuit 
may  soon  become  the  fundamental  constraint  lim¬ 
iting  the  density,  i.e.,  transistors  per  unit  area,  of 
integrated  circuits.  At  MIT,  we  are  working  on 
various  aspects  of  the  problem  of  designing 
integrated  circuits  for  low  power  consumption. 
Some  solutions  include  efficient  power  estimation 
methods  as  well  as  automated  methods  for  logic 
optimization  directly  targeting  low  power  consump¬ 
tion  in  the  optimized  circuit. 


In  addition,  a  circuit  being  designed  must  be 
checked  for  correct  functionality  and  must  meet  per¬ 
formance  goals.  Formal  verification  methods  can 
guarantee  correctness  of  the  design — however,  a 
successful  and  practical  method  must  be  efficient 
and  use  a  realistic  circuit  model.  We  are 
addressing  various  verification  problems  at  the 
architectural,  logic,  and  circuit  levels.  These  prob¬ 
lems  include  verification  of  pipelines,  combinational 
logic  verification,  and  asynchronous  circuit  verifica¬ 
tion.  Verifying  asynchronous  circuits  is  particularly 
difficult  since  a  sequential  circuit  must  be  directly 
checked  for  correct  functionality,  while  in  the  syn¬ 
chronous  case  one  can  separate  the  combinational 
logic  of  the  synchronous  circuit  from  the  memory 
elements.  For  instance,  verifying  the  functionality  of 
an  asynchronous  circuit  requires  timing  simulation 
on  the  combinational  logic  to  identify  the  presence 
of  hazards  that  might  cause  its  malfunction. 

In  the  following  sections,  we  will  describe  our  recent 
work  in  the  areas  of  design  for  low  power  consump¬ 
tion  and  formal  verification  techniques. 


1.10.2  Design  for  Low  Power  Consumption 

Probabilistic  Power  Estimation 

We  have  addressed  the  problem  of  estimating  the 
average  switching  activity  of  VLSI  sequential  cir¬ 
cuits  under  random  input  sequences.  We  have 
developed  a  model  that  can  be  used  to  gauge  the 
power  dissipation  of  the  sequential  circuit  and  to 
make  architectural  or  design-style  decisions  during 
the  VLSI  synthesis  process. 

Because  switching  activity  is  strongly  affected  by 
gate  delays,  we  have  used  a  general  delay  model 
in  estimating  switching  activity.  This  model  con¬ 
siders  glitching  at  gate  outputs  in  the  circuit  while 
modeling  the  inertial  delay  of  a  gate.  Our  method 
considers  correlation  caused  at  internal  gates  in  a 
circuit  due  to  reconvergence  of  input  signals.  In 
combinational  circuits,  uncorrelated  input  patterns 
with  uniform  switching  rates  are  typically  assumed, 
simplifying  the  problem  of  switching  activity  esti- 


'7  K,  Nabors,  S.  Kim,  and  J.  White,  "Fast  Capacitance  Extraction  of  General  Three-Dimensional  Structures,”  IEEE  Trans.  Microwave 
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Methods  lor  Three-Dimensional  Potential  Integral  Equations  of  the  First  Kind,"  SIAM  J.  Sci.  Slat.  Comp.  Forthcoming;  K.  Nabors, 
and  J.  White,  "Multipole-Accelerated  3-D  Capacitance  Extraction  Algorithms  for  Structures  with  Conformal  Dielectrics,"  Proceeding  of 
the  29th  Design  Automation  Conference,  Anaheim,  California,  June  1992,  pp.  710-715;  M.  Kamon,  M.  Tsuk,  C.  Smithhisler,  and  J. 
White,  "Efficient  Techniques  tor  Inductance  Extraction  of  Complex  3-D  Geometries,"  Proceedings  of  the  International  Conference  on 
Computer-Aided  Design,  Santa  Clara,  California,  November  1992,  pp.  438-442;  J.R.  Phillips,  M.  Kamon,  and  J.  White,  "An 
FFT-Based  Approach  to  Including  Non-ideal  Ground  Planes  in  a  Fast  3-D  Inductance  Extraction  Program,"  Proceedings  of  the 
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3-D  Interconnect  Analysis,"  (Invited  paper)  Proceedings  of  the  International  Workshop  on  VLSI  Process  and  Device  Modeling 
{VP AD),  Nara,  Japan,  May  1993. 
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mation.  However,  in  sequential  circuits  the  input 
sequence  applied  to  the  combinational  portion  of 
the  circuit  is  highly  correlated  because  some  of  the 
inputs  to  the  combinational  logic  are  flip-flop  outputs 
representing  the  state  of  the  circuit.  We  have 
developed  approximate  methods  to  probabilistically 
estimate  switching  activity  in  sequential  circuits  that 
automatically  compute  switching  rates  and  corre¬ 
lations  between  flip-flop  outputs. 

Ongoing  work  includes  relaxing  our  current 
assumptions  in  sequential  circuit  power  estimation 
and  improving  the  efficiency  of  our  algorithms. 

Combinational  Logic  Synthesis 

We  are  also  investigating  the  use  of  retiming 
methods  to  improve  power  dissipation.  In  our 
experiments  with  probabilistic  power  analysis,  we 
noted  that  power  consumption  depended  signif¬ 
icantly  on  the  gate-level  structure  of  a  combina¬ 
tional  or  sequential  circuit.  Then  we  became 
interested  in  gauging  exactly  what  logic  structures 
corresponded  to  low  power  realizations.  With  this 
information  we  can  tailor  a  logic  optimization 
strategy  to  produce  circuit  topologies  that  consume 
the  least  amount  of  power. 

We  have  developed  several  strategies  to  optimize  a 
combinational  circuit  to  lower  its  power  consump¬ 
tion:  (^)  timing  optimization  and  Vod  scaling,  (2) 
selective  collapsing,  (3)  don't-care-based  minimiza¬ 
tion,  and  (4)  implementation  as  a  disjoint  cover. 

In  Method  1 ,  the  circuit  is  restructured  for  maximum 
speed,  and  then  the  power  supply  voltage  Vdd  is 
lowered  while  increasing  the  delay  of  the  circuit  to 
the  maximum  allowed  delay.  Power  consumption 
for  the  reduced  Vod  circuit  is  lowered  because 
power  is  proportional  to  Vdd^-  In  Method  2,  the 
delay  of  the  circuit  is  maintained  constant,  but 
portions  of  the  circuit  that  are  not  on  the  critical 
path  are  collapsed  into  two  levels  of  logic,  reducing 
power  dissipation.  In  Method  3,  observability  don't- 
cares  are  used  to  minimize  the  internal  gates  in  the 
circuit.  Minimization  alters  the  signal  probabilities  at 
the  gate  outputs  in  a  manner  such  that  power  con¬ 
sumption  is  reduced.  In  Method  4,  the  circuit  is 
implemented  as  a  disjoint  two-level  cover,  where 
each  AND  gate  is  disjoint  from  all  other  AND 
gates— this  is  a  minimal  power  realization. 

Experimental  results  on  a  variety  of  examples  are 
extremely  encouraging.  Method  2  resulted  in  the 
circuits  with  the  lowest  power  dissipation. 


Sequential  Logic  Synthesis 

We  are  exploring  methods  to  optimize  sequential 
circuits  for  low  power  consumption.  In  particular, 
we  are  developing  methods  for  the  state  assign¬ 
ment  of  finite  state  machines  »hat  result  in  low 
power  dissipation.  For  instance,  if  a  state  S2  can  be 
reached  by  many  input  vectors  from  state  St,  to 
minimize  power,  a  good  heuristic  is  to  assign 
Si  and  S2  uni-distant  codes.  The  combinational 
logic  of  the  state-encoded  machine  can  be  opti¬ 
mized  using  techniques  described  in  the  previous 
section  to  obtain  a  minimal  power  circuit  realization. 


1.10.3  Formal  Verification  Techniques 
Pipeline  Verification 

It  is  difficult  to  verify  a  behavioral  specification 
against  a  logic-level  implementation  since  the  logic- 
level  implementation  can  be  serial,  parallel,  or  pipe¬ 
lined  implementation  of  the  behavioral  description. 
We  are  investigating  automatic  methods  for  veri¬ 
fying  pipelined  implementations  against  unpipelined 
specifications.  These  methods  ensure  that  each 
data  transfer  taking  place  upon  the  execution  of 
any  instruction  in  the  unpipelined  circuit  also  occurs 
in  the  pipelined  circuit.  In  the  case  where  data 
dependence  hazards  exist,  verification  requires 
checking  that  pipelined  circuit  contains  bypass 
logic,  as  well  as  checking  that  this  logic  results  in 
data  transferring  correctly.  Similarly,  branch 
instructions  may  require  purging  of  the  pipeline  or 
disabling  of  register/memory  writes. 

A  symbolic  simulation  method  is  being  developed 
that  will  efficiently  verify  pipelined  (A-processors 
against  instruction  set  specifications. 

Free  Binary  Decision  Diagrams 

We  are  developing  a  set  of  algorithms  for  (1)  the 
probabilistic  construction  of  free  Binary  Decision 
Diagrams  from  multilevel  combinational  logic  cir¬ 
cuits  and  (2)  the  Boolean  manipulation  of  free 
Binary  Decision  Diagrams. 

With  the  aid  of  a  previously  shown  result,  namely 
that  the  equivalence  of  two  free  Binary  Decision 
Diagrams  can  be  decided  probabilistically  in 
polynomial  time,  we  have  developed  a  set  of 
polynomial-time  algorithms  to  reduce,  i.e.,  remove 
redundant  nodes  and  subfunctions  from  a  free 
Binary  Decision  Diagram.  The  algorithms  have  a 
time  complexity  of  0(n''|G|log(|G|))  where  n  is  the 
number  of  primary  inputs  to  the  free  Binary  Deci¬ 
sion  Diagram  G,  |G|  is  the  number  of  nodes  in  G, 
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and  kaO  is  a  constant.  We  have  shown  that 
reduced,  ordered  Binary  Decision  Diagram  con¬ 
struction  methods  that  exploit  the  strongly  canonical 
form  can  be  modified  to  apply  to  free  Binary  Deci¬ 
sion  Diagrams. 

A  concat  algorithm  that  computes  a  free  Binary 
Decision  Diagram  corresponding  to  the  Boolean 
AND  of  two  free  Binary  Decision  Diagrams  has 
been  developed  and  is  currently  being  imple¬ 
mented.  This  concat  algorithm  along  with  the 
k-reduce  algorithm  provides  the  basis  for  a  Boolean 
manipulation  package  that  can  be  used  to  solve 
many  verification  and  testing  problems. 

Asynchronous  Circuit  Verification 

We  have  addressed  the  problem  of  verifying  that 
the  gate-level  implementation  of  an  asynchronous 
circuit,  with  given  or  extracted  bounds  on  wire  and 
gate  delays,  is  equivalent  to  a  specification  of  the 
asynchronous  circuit  behavior  described  as  a  clas¬ 
sical  flow  table. 

We  have  developed  a  procedure  to  extract  the 
complete  set  of  possible  flow  tables  from  a  gate- 
level  description  of  an  asynchronous  circuit  under 
fixed  or  bounded  wire  delay  models.  Given  an 
extracted  flow  table  and  an  initial  flow  table  specifi¬ 
cation,  we  have  implemented  procedures  to  con¬ 
struct  a  product  flow  table  to  check  for  machine 
equivalence,  under  various  modes  of  operation. 

We  have  considered  fundamental  and  non¬ 
fundamental  modes  of  operation.  We  have  taken 
into  account  single  output  change  and  multiple 
output  change  flow  tables  as  well  as  single  input 
change  and  multiple  input  change  flow  tables.  Flow 
table  extraction  and  equivalence  checking  proce¬ 
dures  have  been  tailored  for  each  mode  of  opera¬ 
tion  and  each  type  of  asynchronous  flow  table. 

Under  the  discretized  bounded  delay  model,  where 
each  gate  can  take  on  integer  delay  values  within  a 
range,  we  have  developed  implicit  enumeration 
methods  for  verification.  These  Binary  Decision 
Diagram-based  methods  do  not  require  explicit 
representation  of  each  possible  flow  table  corre¬ 
sponding  to  the  circuit  with  bounded  gate  delays  or 
explicit  enumeration  of  states  in  the  circuit.  These 
methods  are  more  efficient  than  the  corresponding 
ones  for  a  continuous  delay  model. 
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2.1  Introduction 
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Computer-Aided  Fabrication  Environment  (CAFE)  is 
a  software  system  being  developed  at  MIT  for  use 
in  the  fabrication  of  integrated  circuits  (iC)  and 
microstructures.  The  distinguishing  feature  of 
CAFE  is  that  it  can  be  used  in  all  phases  of 
process  design,  development,  planning,  and  manu¬ 
facturing  of  integrated  circuit  wafers.  CAFE  pres¬ 
ently  provides  day-to-day  support  to  research  and 
production  facilities  at  MIT  with  both  flexible  and 
standard  product  capabilities.  This  manufacturing 
software  system  is  unique  because  a  process  flow 
representation  (PFR)  has  been  developed  and 
integrated  into  actual  fabrication  operations.  CAFE 
provides  a  platform  for  work  in  several  active 
research  areas  including  "technology"  (process  and 
device)  computer-aided  design  (TCAD),  process 
modeling,  manufacturing  quality  control,  and  sched¬ 
uling. 


2.1.1  Architecture 

The  CAFE  architecture  is  a  computer  integrated 
manufacturing  (CIM)  framework  for  the  deployment 
and  integration  of  integrated  cr.uit  and  process 
design  and  manufacturing  software.  CAFE  uses  an 
object-oriented  database  model  which  is  imple¬ 
mented  in  a  layered  manner  on  top  of  a  relational 
database.  Our  database  schema  is  based  on 


GESTALT,  an  object-oriented,  extensible  data 
model.  GESTALT  is  a  layer  of  abstraction  which 
maps  user  defined  objects  onto  existing  database 
systeins  (e.g.,  a  relational  database  management 
system)  while  shielding  application  programs  from 
the  details  of  the  underlying  database.  The  inte¬ 
gration  architecture  includes  conceptual  schema 
and  models  used  to  represent  the  integrated  circuits 
manufacturing  domain  in  CAFE  and  user  and 
programmatic  interfaces  to  the  various  applications. 
Two  important  CAFE  applications  relate  process 
simulation  and  actual  wafer  fabrication  to  the  same 
PFR. 


2.1.2  CAFE  Applications 

The  fabrication  of  wafers  with  a  process  repres¬ 
ented  as  a  PFR  involves  several  steps.  A  suitable 
PFR  for  the  specific  process  must  be  created  and 
installed.  Then  wafer  lots  must  be  created  and 
associated  with  this  specific  PFR.  These  lots  must 
then  be  "started"  to  create  a  task  data  structure 
which  is  isomorphic  to  the  hierarchical  structure  of 
the  PFR. 

At  this  point,  actual  machine  operations  can  be 
scheduled  and  reservations  made  for  both 
machines  and  operators.  Finally,  machine  oper¬ 
ations  can  be  performed,  instructions  given  to  the 
operator  and  machines,  and  data  collected  from  the 
operator  or  machine  and  entered  into  the  database. 

Current  work  involves  implementing  hierarchical 
scheduling  and  using  PFR  for  real  time  process 
control  and  integration  of  a  mechanical  TCAD 
system. 
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2.2  Principal  Objectives 

The  primary  objectives  for  1992  were  (1)  evaluation 
of  the  results  of  the  initial  experiments  with  fabri¬ 
cation  via  PFR,  (2)  development  of  applications  and 
improvements  responsive  to  the  initial  users'  com¬ 
ments  and  suggestions,  (3)  integration  of  these 
improvements  into  CAFE,  and  (4)  continued  use  of 
PFR  fabrication  for  lots  using  the  CMOS  baseline 
process.  A  fifth  objective  was  to  develop  an  appli¬ 
cation  to  serve  as  the  initial  integration  of  the  Run- 
by-Run  (RbR)  control  project  into  the  CAFE 
software  system. 


2.3  Principal  Accomplishments 


2.3.1  Implementation  of  PFR-based 
Fabrication 

The  fabrication  of  wafers  with  a  process  repres¬ 
ented  as  a  PFR  involves  several  steps.  A  suitable 
PFR  must  be  created  and  installed  to  create  per¬ 
sistent  flow  objects.  The  tree  structure  of  these 
flow  objects  reflects  the  hierarchical  decomposition 
of  PFR  encoding.  Wafer  lots  must  then  be  created 
and  associated  with  this  PFR.  These  lots  are  then 
"started"  to  create  a  task  data  structure  which  is 
isomorphic  to  the  flow  data  structure.  The  tasks 
point  to  flow  objects,  and  these  tasks  are  used  to 
record  lot  specific  information  because  multiple  lots 
are  usually  fabricated  from  a  single  flow.  At  this 
point,  actual  machine  operations  can  be  scheduled 
and  reservations  made  for  both  machines  and  oper¬ 
ators.  Finally,  the  actual  machine  operations  can 
be  performed,  instructions  given  to  operator  and 
machines,  and  data  collected  from  operator  or 
machine  and  entered  into  the  database. 


2.3.2  Processing  of  All  CMOS  Baseline 
Defect  Array  Lots  via  PFR 

Several  lots  have  been  processed  via  PFR  for  the 
MIT  CMOS  baseline  process.  All  future  fabrication 
runs  of  the  MIT  CMOS  baseline  process  used  to 
process  test  lots  known  as  defect  array  (DA)  lots 
will  be  processed  with  the  aid  of  our  PFR-based 
fabrication.  All  operations  are  driven  by  the  CAFE 
system  using  a  PFR-based  representation  of  the 
process.  Settings  and  operator  instructions  are  dis¬ 
played  (some  are  downloaded  automatically),  and 
specified  readings  and  data  are  collected  via  oper¬ 
ator  entry  or,  in  some  cases,  automatically.  Both 
types  of  information  are  stored  in  the  GESTALT 
database.  The  report  generators  access  the  data¬ 
base  to  retrieve  the  data  and  present  it  in  a  form 
suitable  for  the  user. 


2.3.3  Development  of  PFR-based  Split  Lot 
Processing  Capabilities 

We  have  augmented  the  MIT  PFR  to  include  a 
mechanism  for  user  specified  split  lot  processing. 
Users  can  now  specify  which  wafers  in  a  lot  are  to 
be  processed  by  a  subflow.  With  this  mechanism, 
users  can  specify  different  parameters  or,  indeed, 
completely  different  processing  for  individual  wafers 
or  groups  of  wafers. 


2.3.4  Traveler  and  "Op-set"  Report 
Generators  In  Use 

We  have  completed  projects  to  produce  report  gen¬ 
erators  to  replace  the  traveler  and  "Op-set"  reports 
now  in  use  in  our  Integrated  Circuits  Laboratory 
(ICL)  and  Technology  Research  Laboratory  (TRL). 
The  traveler  reports  provide  a  succinct  summary  of 
the  current  state  of  processing  for  a  lot  augmented 
with  details  concerning  past  and  planned  special 
activities.  These  include  (1)  when  the  lot  was  put 
on  hold,  (2)  reasons  for  putting  the  lot  on  hold,  and 
(3)  what  was  done  to  remove  the  hold  status.  The 
Op-set  reports  are  more  expansive,  including 
details  of  processing,  recipes  used,  operator 
instructions  gleaned  from  PFR,  and  measurements 
and  other  data  that  were  collected  in  accordance 
with  the  PFR  used  for  processing  the  lot. 


2.3.5  Initial  Integration  of  Run-by-Run 
Control  into  CAFE 

Applications  implementing  a  first  integration  of  RbR 
process  control  into  the  MIT  Computer  Integrated 
Manufacturing  (CIM)  system  (CAFE)  have  been 
developed.  Object  oriented  data  structures  have 
been  designed  and  implemented  to  support  the 
setup  of  the  RbR  control  project  and  ongoing 
control  calculations.  The  actual  processing  of 
wafers  is  conducted  via  PFR-based  fabrication 
applications. 

This  initial  control  experiment  uses  a  simple 
process  consisting  of  a  pre-process  measurement, 
an  oxide  etching  step,  and  a  post-process  measure¬ 
ment.  A  Nanospec  is  used  for  both  measurements, 
and  data  sent  via  the  RS  232  interface  for  the 
Nanospec  are  automatically  collected  and  stored  in 
the  database  in  accordance  with  information  speci¬ 
fied  in  a  PFR.  A  LAM  Model  590  plasma  etcher  is 
interfaced  to  a  PC  via  a  SECS  II  RS  232  link.  Etch 
recipes  are  uploaded  and  downloaded  to  a  disk  on 
the  computer  running  CAFE  via  PCNFS.  The  soft- 
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ware  which  dynamically  modifies  the  etch  recipes 
runs  under  the  CAFE  CIM  system. 

The  CAFE  system  was  intended  to  he  used  in  all 
phases  of  process  design,  development,  planning, 
and  manufacturing  of  integrated  circuits.  CAFE 
was  to  provide  a  platform  for  work  in  TCAD, 
process  modeling,  and  manufacturing  quality 
control.  A  requirement  for  the  CAFE  architectural 
framework  was  to  support  a  wide  variety  of  soft¬ 
ware  modules,  including  both  development  tools 
and  on-line  applications.  The  key  components  of 
the  CAFE  architecture  are  the  dat£  model  and  data¬ 
base  schema,  user  interface,  process  flow  and 
wafer  representations,  and  application  programming 
and  database  interfaces. 

Particular  emphasis  was  placed  on  the  use  of  a 
single  PFR  to  drive  both  simulation  and  fabrication 
and  the  actual  use  of  PFR-based  fabrication  in  our 
ICL. 


2.3.6  CAFE  Installed  and  Operational  at 
Case  Western  Reserve  University 

The  CAFE  software  system  has  been  successfully 
installed  at  Case  Western  Reserve  University  in 
Cleveland,  Ohio.  The  hardware  configuration  con¬ 
sists  of  a  Sun  Sparc  2  Workstation  with  48  mega¬ 
bytes  of  memory  and  a  1  gigabyte  disk.  This 
installation  required  purchase  of  an  INGRES 
license.  Installation  was  accomplished  via  network. 
Several  days  of  familiarization  and  system  manager 
training  were  provided  at  MIT  for  Case  personnel. 


2.3.7  PFR-driven  Simulation  Manager 

We  have  developed  a  rather  remarkable  system  for 
the  simulation  of  IC  processing  from  our  PFR. 
Users  can  enter  and/or  edit  their  process  flow 
representation  and  then  run  SUPREM  III  simu¬ 
lations  without  knowing  it  specifically.  A  key  feature 
of  this  software  is  the  "validator"  which  checks  the 
validity  of  previous  simulation  computations  and 
retains  any  valid  computations  after  the  PFR  has 
been  modified.  Thus  computation  time  is  minimized 
while  maintaining  correctness  of  the  final  simu¬ 
lations.  This  software  also  allows  the  user  to  con¬ 
veniently  generate  reports,  such  as  plots  of  impurity 
concentrations  and  calculations  of  sheet  resistance, 
etc. 


2.3.8  Dynamic  Modification  of  PFR-based 
Fabrication  Provided 

Initial  experience  with  PFR-based  fabrication 
resulted  in  user  requests  for  applications  to  enable 
dynamic  modification  of  fabrication  plans.  Users 
wanted  to  be  able  to  skip  planned  operations,  as 
well  as  to  insert  new  operations  (or  sequences  of 
operations)  at  arbitrary  points  in  the  processing 
plan.  In  addition,  users  wanted  to  revise  their  pro¬ 
cessing  plan  (and  its  associated  PFR)  and  fabricate 
according  to  this  new  plan  as  if  it  had  been  in  effect 
from  the  start.  A  restart  lot  mechanism  has  been 
provided  which  permits  a  user  to  initiate  processing 
with  the  best  estimate  of  all  future  processing.  If,  in 
the  course  of  fabrication  processing,  the  user  real¬ 
izes  that  some  modifications  are  required  for  future 
processing  of  this  lot,  then  the  user  can  edit  the 
PFR  and  initiate  the  restart  lot  operation.  The 
restart  lot  function  will  verify  that  the  new  PFR  calls 
for  precisely  the  same  fabrication  operations  as  the 
former  PFR  for  all  processing  that  has  already  been 
accomplished.  This  check  ensures  that  actual, 
completed  processing  history  is  not  changed.  If  it 
is,  then  this  .'estart  lot  operation  fails:  and  the 
program  informs  the  user.  If  the  new  PFR  passes 
this  test,  then  the  former  task  structure  is  deleted 
and  replaced  by  a  new  task  structure  corresponding 
to  the  new  PFR. 


2.3.9  X-based  PFR  Flow  and  Task  Browser 
Enables  Global  View  of  Hierarchical 
Structure 

An  X-based  application  enables  users  to  browse 
through  the  tree  structures  of  either  installed  PFR 
flows  or  the  tasks  associated  with  a  lot  being  pro¬ 
cessed  with  a  PFR-based  flow.  In  addition  to  pro¬ 
viding  the  user  with  a  more  global  perspective  than 
provided  by  individual  machine  operations,  this 
browser  enables  detailed  examination  of  task  or 
flow  attributes  at  any  point  in  the  hierarchy.  It  also 
enables  users  to  easily  specify  tasks  to  be  skipped 
and  places  in  the  task  hierarchy  for  insertion  of 
additional  flows. 


2.3.10  User  Friendly  Hold  Application 
Developed 

A  fabform-based  application  has  been  developed  to 
facilitate  the  creation  of  a  PFR  appropriate  to  the 
specification  of  a  hold  status  for  a  lot.  This  applica¬ 
tion  queries  the  user  and  collects  the  reason  for 
placing  the  lot  on  hold  as  well  as  the  necessary 
conditions  for  removing  the  lot  from  the  hold  status. 
A  new  PFR  is  then  automatically  generated, 
installed,  and  the  corresponding  task  tree  is  gener- 
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ated  and  installed  in  the  task  structure  associated 
with  the  lot.  In  a  subsequent  operation,  an  operator 
will  be  presented  with  this  information  and,  when 
the  hold  status  is  removed,  the  operator  will  enter 
the  appropriate  information  as  specified  in  this  new 
PFR  subflow. 


2.3.11  GESTALT  Extended  to  Support 
Inheritance  and  CLOS 

The  current  version  of  GESTALT  now  supports 
inheritance  and  provides  transparent  access  to  per¬ 
sistent  objects,  which  are  described  and  manipu¬ 
lated  solely  via  CLOS  constructs.  This 
transparency  simplifies  applications  programming, 
as  it  frees  the  programmer  from  the  task  of  trans¬ 
lating  between  programming  language  structures 
and  database  structures,  which  is  required  in  many 
systems.  Applications  programmers  may  utilize  the 
rich  object  modeling,  generic  functions,  interactive 
debuggers,  interpreters,  etc.,  present  in  the  CLOS 
environment  to  further  simplify  programming. 


2.4  Process  Flow  Representatation 

Project  Staff 

Michael  B.  Mcllrath,  Professor  Donald  E.  Troxel 

The  objective  of  this  project  is  the  development  of  a 
Process  Flow  Representation  (PFR)  for  the  inte¬ 
gration  of  technology  (process  and  device)  CAD 
(TCAD)  into  integrated  circuits  semiconductor 
computer-integrated  manufacturing  (CIM).  The 
effort  has  proceeded  on  three  major  fronts:  funda¬ 
mental  modeling,  application  to  process  design  and 
fabrication  at  MIT,  and  cooperative  work  with 
industry  to  define  and  test  a  process  representation 
standard. 

For  high  performance  computing  systems  and  other 
advanced  technology,  concurrence  in  the  design  of 
the  product,  manufacturing  process,  and  factory  is 
crucial.  The  goal  is  to  achieve  fully  integrated 
design  and  manufacture  in  which  the  boundary 
between  design  and  manufacturing  domains  is  elim¬ 
inated.  In  particular,  information  from  the  manufac¬ 
turing  floor  is  continuously  available  from  the 
earliest  stages  of  process  and  device  design 
onward,  while  the  manufacturing  process,  devel¬ 
oped  concurrently  with  the  product,  continues  to 
undergo  design  improvement  and  modification  while 
in  production.  Computer-integrated  design  and 
manufacture  (Cl DM),  therefore,  requires  a  coherent 
manufacturing  process  representation  capable  of 
storing  information  from  a  variety  of  different  know¬ 
ledge  domains  and  disciplines,  and  supporting 
access  to  this  information  in  a  consistent  manner. 


We  believe  that  our  general  semiconductor  process 
modeling  framework  organizes  the  complexity  of 
this  interrelated  information  and  puts  our  PFR  on  a 
sound  footing  by  giving  it  clear  semantics. 

A  high-level  conceptual  model  for  describing  and 
understanding  semiconductor  manufacturing  pro¬ 
cessing  is  a  crucial  element  of  both  the  CIDM 
research  program  and  software  frameworks  for 
TCAD  and  CIM,  including  the  MIT  Computer  Aided 
Fabrication  Environment  (CAFE).  Initially  a  "two- 
stage"  generic  process  step  model  was  used.  This 
described  processing  steps  in  terms  of  two  inde¬ 
pendent  components:  (1)  an  equipment-dependent, 
wafer-independent  stage,  which  maps  equipment 
settings  to  physical  processing  environments,  and 
(2)  an  equipment-independent,  wafer-dependent 
stage,  which  relates  physical  environments  to 
changes  in  the  input  wafers.  Driven  by  the  needs 
of  process  control  and  optimization  research, 
including  sophisticated  modeling,  design,  and 
experimental  model  verification,  our  fundamental 
conceptual  process  model  has  evolved  from  the 
two-stage  generic  process  model  into  one  which  is 
part  of  a  more  general  process  modeling  frame¬ 
work,  in  which  the  earlier  two-stage  model  is  a 
special  case.  Our  approach  to  process  represen¬ 
tation  for  both  TCAD  and  CIM  is  based  on  this 
general  modeling  framework  for  semiconductor  pro¬ 
cessing.  In  this  framework,  state  information  (e.g., 
wafer,  environment,  and  equipment  state),  and 
models,  or  transformations,  that  describe  relation¬ 
ships  between  state  descriptions,  are  formally  iden¬ 
tified  and  described.  The  purpose  of  this 
comprehensive  framework  is  to  enable  an  effective 
representation  that  can  be  used  throughout  the  IC 
semiconductor  process  life-cycle,  from  early  con¬ 
ception  and  design  phases  through  fabrication  and 
maintenance. 

In  the  MIT  CAFE  system,  the  PFR  is  expressed  in 
a  textual  (ASCII)  format  and  then  converted  into 
GESTALT  objects  and  loaded  into  the  CAFE  data¬ 
base.  The  textual  language  of  the  PFR  is 
extensible,  so  that  it  can  flexibly  accommodate 
changes  and  extensions  to  both  the  underlying 
modeling  methodology  and  the  needs  of  specific 
applications.  The  object-oriented  nature  of  the 
GESTALT  database  interface  enables  the  conven¬ 
ient  evolutionary  development  of  CAFE  software 
applications  built  around  the  PFR.  The  PFR  allows 
process  step  descriptions  to  be  "underdetermined," 
for  example,  by  expressing  only  the  wafer-state 
change,  making  it  possible  to  develop  a  process 
incrementally  with  increasing  degrees  of  detail.  In 
addition  to  expressing  the  fundamental  concepts  of 
wafer  transformation  within  individual  process  steps, 
the  PFR  supports  both  hierarchical  and  parameter 
abstraction  and  embedded  computation,  thereby 
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providing  support  for  modular  process  design  and 
development.  Processes  expressed  in  the  PFR  can 
be  simulated  using  a  variety  of  technology  CAD 
tools:  PFR  extensibility  allows  the  incorporation  of 
both  simulator-dependent  and  simulator- 
independent  information.  A  simulation  manager 
application  uses  the  appropriate  information  in  the 
PFR  along  with  knowledge  of  specific  simulators  to 
invoke  simulation  tools  and  maintain  simulation 
state. 

Extensions  to  the  textual  PFR  and  corresponding 
database  schema  support  specification  of  individual 
wafers  or  wafersets  (sublots  or  splits)  within  a  lot 
for  processing,  and  specification  of  engineering 
holds  at  a  particular  step  in  the  process  sequence. 
An  engineering  hold  might  be  planned,  for  example, 
as  part  of  the  fabrication  specification  for  a  process 
which  is  currently  under  experimental  development. 
In  conjunction  with  developments  in  CAFE  that 
support  dynamic  modificaton  of  processes  in  the 
fabrication  line,  the  "hold"  capability  in  the  PFR  can 
also  be  used  in  response  to  unexpected  events  that 
occur  during  fabrication.  With  these  manufacturing 
extensions,  the  PFR  is  used  in  the  CAFE  system  at 
the  MIT  ICL  for  the  fabrication  processes  supported 
by  the  ICL  technical  staff,  with  all  processing  oper¬ 
ations  driven  by  CAFE  from  the  PFR  stored  in  the 
database. 

Work  has  also  proceeded  on  the  definition  of  a 
standard  semiconductor  process  representation 
(SPR)  for  use  in  both  industry  and  research  fabri¬ 
cations,  in  association  with  the  industry  CAD 
Framework  Initiative  (CFI)  Technical  Subcommittee 
on  Technology  CAD  in  defining  broadly  applicable 
TCAD  framework  standards.  Mr.  Mclirath  is  cur¬ 
rently  the  chair  of  the  working  group  on  SPR. 
Based  largely  on  the  conceptual  process  model 
described  above,  a  high-level  information  model  of 
semiconductor  processes  is  being  developed  in 
conjunction  with  Motorola,  IBM,  Texas  Instruments, 
and  other  industry  and  university  representatives. 
A  prototype  SPR  implementation  and  a  program¬ 
ming  interface  have  been  developed  and  employed 
in  an  experimental  TCAD  framework,  which 
included  an  implementation  of  the  semiconductor 
wafer  representation  (SWR),  and  tools  developed 
by  industry  and  other  university  researchers.  An 
industry  survey  of  specific  itemized  requirements  for 
SPR  has  been  completed  and  published  as  part  of 
a  formal  CFI  Request  for  Technology,  which  identi¬ 
fies  mandatory  requirements  and  evaluation  criteria 
for  complete  SPR  proposals  now  under  consider¬ 
ation  by  the  working  group. 


2.5  Technology  CAD  Framework 

Project  Staff 

Michael  B.  Mclirath 

Contemporary  large-scale  software  system  engi¬ 
neering  emphasizes  frameworks,  wherein  common 
structure  and  interface  specifications  enable  both 
current  and  future  software  components  to  be 
integrated  in  a  flexible  and  modular  way.  Software 
may  be  roughly  divided  into  tools,  such  as  a  simu¬ 
lator,  which  perform  some  part  of  an  application 
task,  and  services,  such  as  a  database,  which 
provide  some  necessary  support  capability  used  by 
various  tools.  With  framework  standards,  reusable, 
interchangeable  software  components  from  various 
suppliers  may  be  deployed  in  systems  which 
comply  with  the  standard.  In  the  broad  sense,  a 
framework  standard  specifies: 

•  data  representations  for  the  objects  of  dis¬ 
course  in  the  application  domain  and  their 
semantics,  and  programmatic  interfaces  to 
those  representations,  and 

•  architecture:  that  is,  interactions  among  soft¬ 
ware  components  (tools  and  services),  and 
how  tools  fit  together  to  perform  tasks  for  the 
user. 

Frameworks  have  been  particularly  successful  in 
development  of  electronic  circuit  CAD  systems. 
Standards  for  CAD  frameworks  are  currently  being 
established  by  the  CFI,  a  broad  organization  of 
vendor  and  user  companies  which  has  now 
expanded  its  scope  to  include  technology  (process 
and  device)  CAD.  A  TCAD  framework  standard 
includes  programming  representations  for  the  fun¬ 
damental  objects  of  process  and  device  CAD: 
physical  structures  on  the  wafer,  manufacturing 
process,  and  structure  and  behavior  of  the  resulting 
devices.  A  TCAD  framework  standard  also  speci¬ 
fies  how  application  software  is  structured  to  use 
these  representations  and  the  underlying  software 
services  in  process  and  device  design  and  simu¬ 
lation  activities. 

At  present,  development  of  TCAD  framework  archi¬ 
tectural  standards  is  limited  to  (1)  specifications  of 
implementation  and  interfaces  for  the  wafer  and 
process  representations,  (2)  use  of  domain- 
independent  framework  standards,  such  as  intertool 
communication  (ITC),  user  interface  (Ul),  and  (3) 
extension  language  specifications.  We  are  begin¬ 
ning  to  look  at  higher  level  architectural  issues, 
such  as  the  interrelationships  between  the  frame¬ 
work  data  representations  and  the  connection  of 
compliant  tools  to  achieve  end-user  design  objec¬ 
tives.  We  are  also  investigating  the  larger 
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questions  of  the  relationship  between  frameworks 
for  different  related  domains  (e.g.,  circuit  CAD  and 
TCAD),  and  the  integration  of  design  frameworks 
into  frameworks  for  CIM. 

We  are  currently  active  as  members  of  the  TCAD 
Technical  Subcommittee  (TSC)  of  the  CFI,  contrib¬ 
uting  to  fundamental  modeling,  implementation  (lan¬ 
guage  and  database  technology),  programmatic 
interfaces,  prototyping  development,  and  relation  to 
overall  CIM  and  TCAD  architectural  frameworks. 
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1.1  Introduction 

The  field  of  digital  signal  processing  developed 
because  the  use  of  digital  computers  afforded  more 
flexibility  in  implementing  signal  processing  algo¬ 
rithms  and  systems.  This  field  now  encompasses  a 
variety  of  both  digital  and  analog  technologies, 
spanning  a  wide  range  of  applications,  bandwidths, 
and  realizations.  The  Digital  Signal  Processing 
Group  at  RLE  carries  out  research  on  algorithms 
and  their  applications  in  signal  processing.  Cur¬ 
rently,  we  are  interested  in  applying  these  algo¬ 
rithms  to  signal  enhancement,  processing  of 
undenvater  acoustic  signals,  advanced 
beamforming  for  radar  and  sonar  systems,  and 
signal  coding  and  transmission. 

In  some  of  our  recent  work,  we  have  developed 
new  methods  for  signal  enhancement  and  noise 
cancellation  with  single  or  multisensor  measure¬ 
ments.  We  have  also  been  developing  new 
methods  for  representing  and  analyzing  fractal 
signals.  This  class  of  signals,  which  occurs  in  a 
wide  variety  of  physicai  environments,  has  potential 
application  in  problems  involving  signal  design.  We 
are  also  exploring  potential  applications  of  nonlinear 
dynamics  and  chaos  theory  for  signal  design  and 
analysis. 


In  other  research,  we  are  investigating  applications 
of  signai  and  array  processing  to  ocean  and  struc¬ 
tural  acoustics  and  geophysics.  These  problems 
require  the  combination  of  digital  signal  processing 
tools  with  knowledge  of  wave  propagation  to 
develop  systems  for  short-time  spectral  analysis, 
wavenumber  spectrum  estimation,  source  localiza¬ 
tion,  and  matched  field  processing.  We  emphasize 
the  use  of  real-world  data  from  laboratory  and  field 
experiments  such  as  the  Heard  Island  Experiment 
for  Acoustic  Monitoring  of  Global  Warming  and 
several  Arctic  acoustic  experiments  conducted  on 
the  polar  ice  cap. 

The  DSP  group  is  also  involved  in  research  on 
broadband  and  secure  communication  systems. 
These  activities  include  the  development  of  spread 
spectrum  and  multiple-access  communication 
systems  exploiting  sophisticated  signal  processing 
algorithms,  used  in,  for  example,  personal  wireless 
systems. 

Much  of  our  work  involves  close  collaboration  with 
the  Woods  Hole  Oceanographic  Institution,  MIT 
Lincoln  Laboratory,  and  a  number  of  high  tech¬ 
nology  companies  in  the  Boston  area. 


’  Bolt,  Beranek,  and  Newman,  Inc.,  Cambridge,  Massachusetts. 

2  Department  of  Electrical  Engineering,  Systems  Division,  Faculty  of  Engineering,  Tel-Aviv  University,  Israel;  adjunct  scientist.  Depart¬ 
ment  of  Applied  Ocean  Physics  and  Engineering,  Woods  Hole  Oceanographic  Institution,  Woods  Hole,  Massachusetts. 
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1.2  Oceanographic  Signal  Processing 

Sponsor 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  NOO0 1 4-9 1-J- 1628 

Project  Staff 

Professor  Arthur  B.  Baggeroer 

Our  research  programs  in  this  area  involve  the 
application  of  signal  and  array  processing  to  prob¬ 
lems  in  ocean  acoustics  and  geophysics.  These 
problems  require  an  understanding  of  both  signal 
processing  and  wave  propagation;  moreover,  most 
rely  on  data  from  laboratory  or  field  experiments,  so 
an  appreciation  of  real  world  issues  such  as  noise, 
sensor  calibrations  and  modeling  errors  is  needed. 
Several  of  the  topics  provide  the  opportunity  to  par¬ 
ticipate  in  oceanographic  cruises. 


1 .2.1  Acoustic  Thermometry  of  Ocean 
Climate 

The  Heard  Island  Feasibility  Test  demonstrated  that 
coded  acoustic  signals  can  be  transmitted  over 
10,000  km  ranges.  This  test  led  to  the  Acoustic 
Thermometry  of  Ocean  Climate  (ATOC)  program, 
which  is  funded  by  the  Defense  Advanced  Projects 
Agency  (DARPA).  A  network  of  acoustic  sources 
and  receivers  will  be  deployed  in  the  Pacific, 
Atlantic,  and  Indian  Oceans  to  monitor  them  by 
measuring  changes  in  acoustic  travel  times.  These 
changes  will  be  used  to  infer  temperature  changes. 


1.2.2  Signal  Processing  for  Large  Aperture 
Arrays 

There  is  extensive  literature  on  the  subject  of  adap¬ 
tive  processing  for  arrays  with  small  numbers  of 
sensors.  However,  when  an  array  has  a  large 
number  of  sensors,  some  problems  can  result.  The 
main  problem  is  that  the  statistical  characterization 
of  the  ambient  field  requires  an  amount  of  data 
which  exceeds  the  short-term  stationarity  of  the 
field.  Introducing  adaption  by  adding  sensors  can 
lead  to  lower  quality  performance  because  addi¬ 
tional  degrees  of  freedom  are  not  well  character¬ 
ized.  Our  research  develops  adaptive  array 
processing  algorithms  which  incorporate  the  statis¬ 
tical  characterization  of  the  ambient  field  with  a 
large  number  of  sensors. 


1.2.3  Matched  Field  Processing 
Project  Staff 

Kathleen  E.  Wage,  Professor  Arthur  B.  Baggeroer 

Matched  field  processing  is  used  in  underwater 
acoustics  for  processing  data  from  large  arrays 
when  the  inhomogeneity  of  the  signal  field  is  impor¬ 
tant.  It  is  particularly  applicable  to  vertical  arrays 
used  for  low  frequency,  long-range  propagation. 
Two  major  applications  of  matched  field  methods 
are  to  determine:  (1)  the  location  (range,  depth  and 
bearing)  of  an  acoustic  source;  and  (2)  the  sound 
speed  of  the  acoustic  channel  when  the  location  of 
the  source  and  receiver  (tomography)  are  known. 
Matched  field  processing  can  improve  the  resolution 
performance  of  an  array  by  a  factor  of  100  by 
exploiting  coherence  among  the  multipath,  or 
modes,  in  acoustic  propagation.  Some  of  our 
important  research  topics  include  (1)  developing 
robust  methods  which  are  tolerant  of  environmental 
uncertainties  and  (2)  determining  "optimum  array 
geometries"  and  adaptive  processing  which  incor¬ 
porates  the  mesoscale  dynamics  of  the  ocean. 


1.3  Segmentation  of  Polarimetric 
Synthetic  Aperture  Radar  Images  Using 
Markov  Random  Field  Techniques 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
MIT  -  Woods  Hole  Oceanographic  Institution 
Joint  Program 

Project  Staff 

Saurav  Dev  Bhatta,  Dr.  Jacques  Verly,  Professor 
Alan  V.  Oppenheim 

Synthetic  Aperture  Radar  (SAR)  images  constitute 
one  major  class  of  speckled  images.  Speckle 
obscures  boundries,  so  the  goal  of  segmentation  is 
to  recover  obscured  regions  from  the  speckled 
images.  Derin  and  Kelly^  and  their  collegues  at  the 
University  of  Massachusetts  at  Amherst  have  devel¬ 
oped  and  tested  a  technique,  which  uses  Markov 
Random  Field  models  to  segment  speckled  images. 
We  are  developing  a  method  for  segmenting  Polar¬ 
imetric  SAR  (POL-SAR)  images,  which  is  based  on 


3  H.  Derin,  P.A.  Kelly,  G.  Vezina,  and  S.G.  Labitt,  "Modelling  and  Segmentation  of  Speckled  Images  Using  Complex  Data,"  IEEE 
Trans.  Geosci.  Remote  Sens.  (28  (1):  76-87  (1990). 
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the  technique  developed  at  University  of  Massachu¬ 
setts. 

The  image  model  used  here  consists  of  a  two-level 
hierarchical  process  that  makes  use  of  the  fact  that 
speckle  exhibits  spatial  correlation.  The  first  level, 
called  the  Region  Process,  is  characterized  by  a 
Markov  Random  Field  (MRF)  that  groups  the  pixels 
of  the  image  into  regions.  The  second  level,  the 
Speckle  Process,  is  characterized  by  another 
random  field  (not  necessarily  MRF)  peculiar  to  the 
region  under  consideration.  The  two  types  of  pro¬ 
cesses  are  independent  of  each  other  and  between 
themselves.  Only  the  Speckle  Process  is  obsen/- 
able.  The  precise  details  of  the  Speckle  Process 
depend  upon  the  kind  of  imagery  under  consider¬ 
ation:  amplitude  (complex)  versus  intensity  (real) 
and  single-look  versus  multilook. 

The  algorithm  is  based  on  a  Maximum  a  Posteriori 
(MAP)  formulation  to  estimate  segmentation  (which 
is  the  underlying  Region  Process),  given  the  obser¬ 
vation  data  (which  is  directly  related  to  the  Speckle 
Process).  In  particular,  the  Simulated  Annealing 
(SA)  procedure  is  used  to  obtain  the  MAP  estimate 
of  the  regions  of  the  image.  This  algorithm  pro¬ 
ceeds  in  a  series  of  iterations,  each  consisting  of  an 
estimation  of  the  necessary  region  and  speckle 
parameters  followed  by  segmentation  based  on 
these  estimated  parameters.  The  algorithm  is  adap¬ 
tive,  estimating  necessary  parameters  automat¬ 
ically,  when  given  the  number  of  regions  in  the 
image. 

1.4  Signal  Synthesis  from  Marine 
Mammal  Communications 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
MIT  -  Woods  Hole  Oceanographic  Institution 
Joint  Program 

Project  Staff 

John  R.  Buck,  Professor  Alan  V.  Oppenheim,  Dr. 
Josko  Catipovic 

Marine  mammals  such  as  whales  and  dolphins 
utilize  a  wide  variety  of  sounds  for  unden^rater 
acoustic  communication  while  engaging  in  a  variety 
of  complex  social  behaviors.  It  is  appealing  to 
speculate  that  millenia  of  evolution  have  brought 
these  animals  to  the  point  where  their  signals  are  in 


some  sense  optimum  for  the  underwater  environ¬ 
ment.  By  studying  their  signals,  we  hope  to  dis¬ 
cover  which  features  make  the  signals  well-suited 
for  the  undenwater  acoustic  environment  and  under 
which  criteria,  if  any,  the  signals  are  optimal.  The 
knowledge  we  gain  by  analyzing  the  biological 
signals  should  indicate  an  approach  for  synthe¬ 
sizing  new  optimal  signals  for  underwater  communi¬ 
cations. 


1.5  Analysis,  Synthesis,  and 
Applications  of  Synchronized  Chaotic 
Systems 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
Lockheed  Sanders,  Inc./U.S.  Navy 
Office  of  Naval  Research 
Contract  N00014-91-C-0125 
U.S.  Air  Force  -  Office  of  Scientific  Research 
Grant  AFOSR-91-0034 

Project  Staff 

Kevin  M.  Cuomo,  Professor  Alan  V.  Oppenheim 

Chaotic  systems  provide  a  rich  mechanism  for 
signal  design  and  generation  for  communications 
and  a  variety  of  signal  processing  applications. 
Because  chaotic  signals  are  typically  broadband, 
noiselike,  and  difficult  to  predict,  they  can  be  uti¬ 
lized  in  various  contexts  for  masking  information¬ 
bearing  waveforms  and  as  modulating  waveforms  in 
spread  spectrum  systems.  It  is  often  desirable  to 
utilize  high-dimensional  chaotic  systems  with  speci¬ 
fied  characteristics.  However,  there  is  a  practical 
difficulty  because  the  analysis  and  synthesis  of 
chaotic  systems  is  not  well  understood  due  to  their 
highly  nonlinear  nature.  A  significant  challenge  and 
opportunity  exists  for  exploring  and  developing  new 
methods  for  chaotic  signal  generation  and 
detection. 

One  objective  of  this  research  is  to  develop  theore¬ 
tical  and  numerical  techniques  for  synthesizing  a 
class  of  high-dimensional  chaotic  systems.  Another 
objective  is  to  develop  methods  for  embedding  an 
information-bearing  waveform  in  a  chaotic  carrier 
signal  and  then  recovering  the  information  at  the 
receiver.  We  show  that  synchronized  chaotic 
systems  (SCSs)  are  well-suited  to  this  problem 
because  SCSs  possess  a  self-synchronization  prop- 
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erty/  We  also  show  that  choatic  signal  masking 
allows  analog  signals  to  be  privately  transmitted 
and  recovered  at  the  intended  receivers.®  In  Cuomo 
and  Oppenheim,®  we  show  that  digital  bit  streams 
can  be  privately  communicated  by  modulating  a 
transmitter  coefficient  with  the  information  waveform 
and  detecting  the  modulation  with  an  SCS  at  the 
receiver. 


1.6  High-Resolution  Direction  Finding 
for  Multidimensional  Scenarios 

Sponsor 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N0001 4-91 -J-1 628 

Project  Staff 

Jacek  Jachner,  Professor  Alan  V.  Oppenheim,  Dr. 
Harry  B.  Lee 

Recently,  there  has  been  considerable  interest  in 
applying  high-resolution  techniques  for  direction 
finding  (DF)  and  time  series  analysis.  Results  from 
the  research  of  Lee  and  Wengrovitz,^  have 
improved  our  understanding  of  high-resolution 
direction  finding  techniques  in  the  following  areas: 

•  Beamformer  design  for  beamspace 
approaches; 

•  Analytical  expressions  for  the  threshold  SNR  at 
which  algorithms  can  resolve  closely-spaced 
sources: 

•  Cramer-Rao  lower  bounds  on  the  variances  of 
unbiased  estimators  of  direction; 

•  Covariance  matrix  eigenstructure  for  closely- 
spaced  sources. 

The  results  are  applicable  to  far-field  planar  sce¬ 
narios  in  which  the  location  of  each  source  is  speci¬ 
fied  by  a  single  angular  parameter. 


Many  practical  applications  of  DF  techniques  are 
multidimensional  in  nature,  requiring  estimation  of  a 
vector  of  parameters.  For  example,  two  angular 
parameters  are  necessary  in  3-D  far-field  problems 
(azimuth,  elevation).  Three  parameters  may  be 
required  in  some  3-D  emitter  localization  problems 
(range,  azimuth,  elevation).  Extension  of  1-D 
approaches  to  multi-D  is  not  always  direct,  as 
several  high-resolution  techniques,  including 
MinNorm  and  minimum  dimension  Beamspace 
algorithms,  fail  to  uniquely  locate  sources  for  multi¬ 
dimensional  scenarios.  This  research  explores  the 
multidimensional  direction  finding  problem  and 
extends  recent  1-D  results  to  multi-D  scenarios. 


1.7  Signal  Processing  Applications  of 
Chaotic  Dynamical  Systems 

Sponsors 

AT&T  Laboratories  Doctoral  Support  Program 
Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
Lockheed  Sanders,  Inc./U.S.  Navy 
Office  of  Naval  Research 
Contract  N00014-91-C-0125 
U.S.  Air  Force  -  Office  of  Scientific  Research 
Grant  AFOSR-91-0034 

Project  Staff 

Steven  H.  Isabelle,  Professor  Alan  V.  Oppenheim 

Researchers  in  areas  ranging  from  animal  behavior 
and  medicine  to  economics  and  geophysics  have 
found  evidence  of  chaotic  behavior  in  an  enormous 
number  of  empirically  gathered  time  series.  Indeed, 
the  sheer  volume  of  apparently  random  phenomena 
which  appear  to  have  a  deterministic  explanation 
underscores  the  need  for  signal  processing  tech¬ 
niques  specifically  tailored  to  the  unique 
charateristics  of  chaotic  signals.  In  particL’''’.r, 
because  chaotic  signals  can  generally  be  observed 


*  L.M.  Pecora  and  T.L.  Carroll,  "Synchronization  in  Chaotic  Systems,"  Phys.  Rev.  Lett.  64(8):  821-824  (1990);  T.L.  Carroll  and  L.M. 
Pecora,  "Synchronizing  Chaotic  Circuits,"  IEEE  Trans.  Circuits  Syst.  38(4):  453-456  (1991). 

5  K.M.  Cuomo,  A.V.  Oppenheim,  and  S.H.  Isabelle,  Spread  Spectrum  Modulation  and  Signal  Masking  Using  Synchronized  Chaotic 
Systems,  RLE  TR  No.  570  (Cambridge:  MIT  Research  Laboratory  of  Electronics,  1992);  A.V.  Oppenheim,  G.W.  Wornell,  S.H. 
Isabelle,  and  K.M.  Cuomo,  "Signal  Processing  in  the  Context  of  Chaotic  Signals,"  Proc.  IEEE  ICASSP-92,  March  1992. 

6  K.M.  Cuomo  and  A.V.  Oppenheim,  Synchronized  Chaotic  Circuits  and  Systems  lor  Communications,  RLE  TR  No.  575.  (Cambridge: 
MIT  Research  Laboratory  of  Electronics,  1992). 

r  H.B.  Lee  and  M.S.  Wengrovitz,  "Resolution  Threshold  of  Beamspace  MUSIC  for  Two  Closely  Spaced  Emitters,"  IEEE  Trans.  ASSP, 
38(9):  1545-1559  (1990);  H  B.  Lee  and  M.S.Wengrovitz,  "Beamformer  Preprocessing  for  Enhanced  Resolution  by  the  MUSIC  Algo¬ 
rithm,"  IEEE  Trans.  ASSP.  forthcoming;  H.B.  Lee,  "The  Cramer-Rao  Bound  on  Frequency  Estimates  of  Signals  Closely  Spaced  in 
Frequency,"  IEEE  Trans.  ASSP,  forthcoming. 
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only  indirectly,  e.g.,  through  some  propagation 
channel  or  nonideal  laboratory  instrumentation,  a 
signal's  chaotic  structure  may  be  partially  obscured 
by  additive  noise  and  convolutional  distortion.  Con¬ 
sequently,  algorithms  for  reducing  this  distortion  are 
an  important  component  of  signal  processing 
systems  for  chaotic  signals.  This  research  explores 
the  effect  of  convolutional  distortion  on  chaotic 
signals  along  with  techniques  for  reducing  such  dis¬ 
tortions. 

In  general,  the  limiting  trajectory  of  a  chaotic 
system  will  be  a  highly  structured  set  in  the  state 
space,  while  the  scalar  output  will  appear  erratic 
and  unstructured.  It  is  this  "hidden"  structure  that 
makes  the  signal  interesting  and  allows  for  a  simple 
description.  One  measure  of  structure  which  has 
been  used  to  characterize  a  chaotic  signal  is  the 
fractal  dimension  of  its  strange  attractor.  We  are 
examining  the  effect  of  convolution  on  fractal 
dimension  and  using  these  results  to  develop 
deconvolution  algorithms.  The  major  challenge  here 
is  developing  optimal  computationally  efficient  tech¬ 
niques  which  are  uniformly  applicable  to  a  broad 
class  of  chaotic  signals. 


1 .8  Wavelet-Based  Representation  and 
Algorithms  for  Generalized  Fractal 
Signals 

Sponsors 

AT&T  Laboratories  Doctoral  Support  Program 
Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
Lockheed  Sanders,  Inc./U.S.  Navy 
Office  of  Naval  Research 
Contract  N00014-91-C-0125 
U.S.  Air  Force  -  Office  of  Scientific  Research 
Grant  AFOSR-91-0034 

Project  Staff 

Warren  M.  Lam,  Professor  Gregory  W.  Wornell 

While  the  1/f  family  of  fractal  processes  have 
become  increasingly  appealing  for  modeling  statis¬ 
tically  scale-invariant  phenomena,  we  study  a  gen¬ 
eralization  of  this  signal  model  to  account  for  more 
general  scaling  behavior  found  in  a  wide  variety  of 


natural  phenomena.  Many  phenomena  exhibit 
scaling  behavior,  but  a  great  number  of  them  do 
this  only  over  a  finite  range  of  scales.  For  example, 
while  seafloor  morphology  is  observed  to  be  self¬ 
similar  at  fine  scales,  such  scaling  behavior  is 
absent  in  long-length  scales  due  to  the  lack  of  cor¬ 
relation  among  points  far  apart.®  On  the  other  hand, 
many  phenomena  exhibit  scaling  behavior  which 
varies  over  scales.  For  instance,  such  varying 
scaling  behavior  is  encountered  in  the  study  of 
diluted  gels  and  colloidal  aggregates  in  the  field  of 
materials  science.® 

In  this  work,  we  focus  on  the  development  of  a 
class  of  generalized  fractal  processes  for  capturing 
such  nonuniform  scaling  behavior.  Exploiting  the 
role  of  the  wavelet  transformation  as  a  whitening 
filter  for  these  kinds  of  processes,  we  formulate 
algorithms  for  addressing  a  number  of  practical  esti¬ 
mation  problems  involving  these  signals.  Adopting 
a  maximum-likelihood  criterion  and  invoking  an 
estimate-maximize  algorithm,  we  derive  consistent, 
computationally  efficient  spectral  parameter  estima¬ 
tors  which  are  useful  for  the  classification  of  gener¬ 
alized  fractals.  We  also  formulate  a  Bayesian 
minimum  mean-squares  error  signal  estimation 
scheme  which  is  directly  applicable  in  a  variety  of 
signal  separation  and  signal  recovery  scenarios. 


1.9  Acoustic  Tomography  through 
Matched  Field  Processing 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
MIT  -  Woods  Hole  Oceanographic  Institution 
Joint  Program 

Project  Staff 

James  M.  Njeru,  Dr.  James  C.  Preisig,  Dr.  James 
F.  Lynch 

Ocean  Acoustic  Tomography  is  a  technique  in 
which  sound  is  transmitted  through  the  sea  and, 
based  on  the  received  signal,  inferences  about 
properties  of  the  ocean  are  made.  Traditional  ocean 
acoustic  tomography  has  relied  on  travel  time 
measurements  of  the  signal  to  infer  properties  such 
as  sound  speed  profiles  and  currents. 


8  J.A.  Goft  and  T.H.  Jordan,  "Stochastic  Modeling  of  Seafloor  Morphology.  Inversion  of  Sea  Beam  Data  for  Second-Order  Statistics," 
J.  Geophys.  Res.  93  (B11):  13589-13608  (1988). 

9  M.  Fleischmann,  D.J.  Tildesley,  R.C.  Ball,  eds..  Fractals  in  the  Natural  Sciences  (Princeton,  New  Jersey:  Princeton  University  Press, 
1989),  pp.  35-53. 


259 


Chapter  1 .  Digital  Signal  Processing  Research  Program 


In  this  work,  we  explore  the  viability  of  exploiting 
more  than  travel  time  information  embedded  in  the 
received  signal  to  make  tomographic  inferences. 
Using  efficient  signal  processing  algorithms,  we  will 
initially  focus  on  frontal  location  (a  front  is  a  region 
where  an  ocean  sound  speed  profile  suddenly 
changes).  Data  acquired  during  the  Barents  Sea 
Experiment  of  1992  will  be  used  in  this  investi¬ 
gation. 


1.10  Binary  Data  Transmission  and 
Detection  with  Chaotic  Signals 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
U.S.  Air  Force  -  Office  of  Scientific  Research 
Grant  AFOSR-91-0034 

Project  Staff 

Haraiabos  C.  Papadopoulos,  Professor  Gregory  W. 
Wornell 

Chaotic  waveforms  provide  a  rich  family  of  signals 
whose  behavior  and  properties  are  currently  being 
explored  in  various  signal  possessing  contexts. 
Classical  signal  processing  corresponds  to  pro¬ 
cessing  signals  that  are  either  deterministic  and 
predictable  or  stochastic.  Chaotic  signals  provide  a 
qualitatively  different  workspace.  Signals  generated 
from  chaotic  systems  are  not  predictable  even 
though  they  are  completely  deterministic.  These 
signals  are  very  sensitive  to  infinitesimal  changes  in 
initial  conditions,  so  that  long-term  predictability  is 
impossible.  Chaotic  waveforms  provide  excellent 
candidates  for  use  in  a  variety  of  contexts  in  signal 
processing  due  to  their  resemblance  to  random 
signals  and  their  ease  of  implementation. 

In  this  project,  we  are  considering  a  very  common 
problem  in  classical  signal  processing:  the  trans¬ 
mission  of  a  binary  data  stream  via  waveform 
representation.  Chaotic  signals  are  introduced  in 
coding  the  binary  stream.  Chaotic  waveforms  may 
prove  very  powerful  in  this  context  due  to  their  ease 
of  implementation,  noiselike  appearance,  and 
broadband  spectra.  The  signals  used  in  such  com¬ 
munication  schemes  have  distinct  dynamical 
behavior  and  characteristics.  Given  a  sequence  of 
noisy  obsen/ations,  we  may  readily  set  up  a  binary 
hypothesis  test  in  order  to  detect  the  original  binary 
stream. 


A  previously  implemented  preliminary  heuristic 
detection  algorithm  shows  that  signal  detection  with 
sufficiently  high  SNR  may  be  accomplished  if  suffi¬ 
ciently  long-sequence  representations  are  used. 
More  sophisticated  algorithms  are  explored  which 
take  into  account  properties  of  the  distinct  dynam¬ 
ical  behavior  of  the  chaotic  signals  that  are  used  to 
code  the  binary  stream. 

1.11  Real-Time  Active  Noise 
Cancellation 

Sponsor 

National  Science  Foundation  Fellowship 

Project  Staff 

Brian  M.  Perreault,  Professor  Alan  V.  Oppenheim 

In  many  industrial  and  consumer  settings,  undesir¬ 
able  acoustic  noise  often  exists.  In  many  cases, 
the  noise  is  obtrusive  enough  to  warrant  an  effort  to 
reduce  its  effect.  In  situations  where  the  noise 
cannot  be  reduced  using  physical  passive  methods, 
active  noise  cancellation  (AND)  techniques  are  a 
suitable  alternative.  Many  types  of  noise  have 
certain  predictable  characteristics;  active  noise  can¬ 
cellation  exploits  these  characteristics  and  uses 
destructive  interference  from  a  secondary  acoustic 
source  to  eradicate  the  noise. 

Many  predictive  methods  used  in  active  noise  can¬ 
cellation  are  computationally  intensive;  to  employ 
them  in  a  practical  manner,  real-time  hybrid  tech¬ 
niques  must  be  developed.  Intelligent  tradeoffs 
must  be  made  between  an  a'gorithm’s  computa¬ 
tional  efficiency  and  its  predictional  effectiveness. 

AND  techniques  would  be  highly  effective  in  a 
plane  or  helicopter  cockpit.  Such  a  setting  has  veiy 
high  noise  levels,  and  the  noise  is,  to  a  certain 
degree,  predictable.  A  set  of  headphones,  utilizing 
a  microphone  and  speaker  in  each  ear  cup,  is 
being  developed  to  cancel  the  cockpit  noise.  This  is 
desirable  because  passive  attenuation  provided  by 
headphones  without  ANC  does  not  compensate  for 
the  noise. 

Digital  predictive  techniques  are  being  used  in  con¬ 
junction  with  analog  to  digital  (A/D)  and  digital  to 
analog  (D/A)  converters  to  implement  active  noise 
cancellation  techniques.  A  digital  signal  processor, 
which  has  high  computational  ability,  was  employed 
to  execute  the  algorithms.  A  flexible  set  of  hard¬ 
ware  has  been  developed  to  evaluate  different  ANC 
algorithms  in  real  time. 
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1.12  State  and  Parameter  Estimation 
with  Chaotic  Systems 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
U.S.  Air  Force  -  Office  of  Scientific  Research 
Grant  AFOSR-91-0034 

Project  Staff 

Michael  D.  Richard,  Professor  Alan  V.  Oppenheim 

Chaotic  systems  are  nonlinear  dynamical  systems 
characterized  by  extreme  sensitivity  to  initial  condi¬ 
tions.  A  signal  generated  by  a  chaotic  system  may 
appear  random,  despite  its  having  been  generated 
by  a  low-order,  deterministic  dynamical  system. 
Both  random  and  chaotic  signals  lack  long-term 
predictability:  but,  in  contrast  to  truly  random 
signals,  chaotic  signals  exhibit  short-term  predict¬ 
ability.  Evidence  of  chaotic  behavior  has  been 
reported  in  many  diverse  disciplines  including 
physics,  biology,  engineering,  and  economics. 

We  are  exploring  techniques  for  state  and  param¬ 
eter  estimation  with  chaotic  systems.  We  have 
implemented  the  extended  Kalman  filter,'°  a  recur¬ 
sive  state  estimator  for  nonlinear  systems,  and 
several  related  algorithms,"  and  have  evaluated 
their  effectiveness  as  s*ate  estimators  for  chaotic 
systems.  Preliminary  results  have  shown  these 
algorithms  to  perform  reasonably  well.  But  these 
results  have  also  shown  that  these  algorithms  suffer 
potentially  unacceptable  deficiencies  when  applied 
to  chaotic  systems. 

More  recently,  we  have  developed  and  begun 
testing  several  related,  novel,  state-estimation  tech¬ 
niques  loosely  motivated  by  maximum  likelihood 
state  estimation. The  techniques  exploit  a  distin¬ 
guishing  property  of  all  chaotic  systems — the  simul¬ 
taneous  existence  of  stable  and  unstable 
manifolds. The  combination  of  these  techniques 


with  an  Estimate-Maximize  (EM)  algorithm  is  also 
being  considered.  Finally,  we  plan  to  ascertain  the 
value  of  various  state-estimation  techniques  in 
improving  the  short-term  predictability  of  chaotic 
signals. 

1.13  Nonlinear  Models  for  Signal 
Processing 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
Lockheed  Sanders,lnc./  U.S.  Navy  -  Office  of 
Naval  Research 
Contract  N00014-91-C-0125 

Project  Staff 

Andrew  C.  Singer,  Professor  Alan  V.  Oppenheim 

This  research  involves  the  use  of  nonlinear  system 
models  in  signal  processing.  Linear  models  and 
algorithms  have  been  predominantly  employed  in 
this  field  due  to  the  tractability  of  their  analysis  and 
richness  of  the  class  of  signals  for  which  these 
models  are  well-suited.  However,  by  trading  com¬ 
plexity  for  tractability,  we  can  further  broaden  the 
boundaries  of  signal  processing.  For  example, 
recently  we  have  shown  that  nonlinear  signal  mod¬ 
eling  techniques  are  both  useful  and  practical  for 
modeling  a  variety  of  signals  for  which  linear  tech¬ 
niques  have  proven  inadequate.  Furthermore,  we 
have  begun  to  find  practical  applications  for  non¬ 
linear  dynamic  systems  in  chaos.  Until  recently, 
these  applications  were  considered  to  be  more  par¬ 
adoxical  than  practical.  Another  interesting  phe¬ 
nomenon,  whose  curious  behavior  has  crossed 
many  disciplines  of  science,  is  the  theory  of 
so^^tons.  It  is  our  goal  to  exploit  the  behavior  of 
these  solitary  waves  and  other  nonlinear  phe¬ 
nomena  in  search  of  new  paradigms  and  new  direc¬ 
tions  in  signal  proce".-  j. 


’0  A.  Jazwinski  S'  ichastic  Processes  and  Filtering  Theory  (New  York:  Academic  Press.  1970):  M.D.  Richard.  State  Estimation  with 
Discrete-Time  Chaotic  Systems  using  the  Extended  Kalman  Filter,  unpublished  technical  report,  September  1991. 

’  ’  A.  Willsky,  "Course  Notes  for  6.433  Recursive  Estimation,"  unpublished,  1 989. 

'2  M.D,  Richard.  State  Estimation  with  Discrete-Time  Chaotic  Systems  using  the  Extended  Kalman  Filter,  unpublished  technical  report, 
September  1991. 

'3  J.  Eckmann  and  D.  Ruelle,  "Eigodic  Theory  of  Chaos  and  Strange  Attractors,"  Rev.  Mod.  Phys.  57(3)  Part  1. 617-656  (1985). 
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1.14  Model-Based  Analysis  of  Music 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 

National  Science  Foundation  Graduate  Fellowship 

Project  Staff 

Stephen  F.  Scherock,  Dr.  Bernard  Gold 

Many  digital  audio  signals  can  be  broadly  classified 
as  speech  or  music.  Speech  signals  have  been 
studed  extensively  since  the  1950s  for  automatic 
speech  recognition,  production,  and  data  com¬ 
pression.  Work  has  advanced  toward  each  of 
these  goals  in  part  because  of  the  development  of 
models  for  speech  production  and  recognition. 
While  physically  based  models  exist  for  music  pro¬ 
duction,  applying  these  models  for  synthetic  music 
production  or  data  compression  has  not  been  fully 
exploited.  My  research  involves  model-based  com¬ 
pression  of  music  from  a  single  instrument,  the 
trumpet. 


1.15  Active  Noise  Cancellation 

Sponsors 

Defense  Advanced  Research  Projects  Agency/ 
U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-89-J-1489 
U.S.  Air  Force  ■  Office  of  Scientific  Research 
Grant  AFOSR-91-0034 

Project  Staff 

Kambiz  C.  Zangi,  Professor  Alan  V.  Oppenheim 


Unwanted  acoustic  noise  is  a  by-product  of  many 
industrial  processes  and  systems.  With  active  noise 
cancellation  (ANC),  one  introduces  secondary  noise 
sources  to  generate  an  acoustic  field  that  interferes 
destructively  with  the  unwanted  noise,  thereby  elim¬ 
inating  it.  Examples  of  such  unwanted  noise 
include  machinery  noise,  aircraft  cabin  noise,  and 
fan  noise. 

Traditional  active  noise  cancellation  systems 
assume  a  priori  knowledge  of  the  transfer  function 
from  the  cancelling  speaker  to  the  point  where 
noise  cancellation  is  desired.  Furthermore,  almost 
all  existing  systems  use  two  microphones,  and,  as 
a  result,  suffer  from  acoustic  feedback  between  the 
cancelling  speaker  and  input  microphone. 

We  have  developed  an  adaptive  active  noise  can¬ 
cellation  system  which  uses  one  microphone  only 
and  therefore  has  no  feedback  problem.  This 
system  uses  the  estimate  maximize  (EM)  algorithm 
to  simultaneously  estimate  the  noise  statistic  and 
transfer  function  between  the  cancelling  speaker 
and  the  microphone.  An  optimal  cancelling  signal  is 
then  generated  based  on  these  estimates.'® 

We  have  also  developed  a  two-microphone  version 
of  the  system  mentioned  above  which  does  not 
suffer  from  a  feedback  problem  and  is  more  intelli¬ 
gent  than  the  existing  algorithms  in  using  the 
outputs  of  the  two  microphones.'® 

We  are  currently  studying  the  problem  of  noise  can¬ 
cellation  in  a  volume.  A  topic  of  fundamental 
interest  is  to  find  analytically  simple  ways  to 
describe  the  sound  field  over  a  volume  from 
measurements  made  at  a  finite  set  of  points  in  that 
volume.  Similarly,  we  would  like  to  find  ways  to 
alter  the  sound  field  in  a  desirable  manner  using 
only  a  finite  number  of  sources. 


L.J.  Eriksson,  M.C.  Allie.  and  C  D.  Bremigan,  "Active  Noise  Contro'  Using  Adaptive  Digital  Signal  Processing."  Proc.  ICASSP,  New 
York,  1988,  pp.  2594-2597. 

'5  E.  Weinstein,  A  Oppenheim,  and  M.  Feder,  Signal  Enhancement  Using  Single  and  Multi-Sensor  Measurements.  RLE  TR  No.  560 
(Cambridge:  MIT  Research  Laboratory  of  Electronics,  1990);  A.  Oppenheim,  E,  Weinstein,  K.  Zangi,  M.  Feder,  and  D.  Gauger, 
"Single  Sensor  Active  Noise  Cancellation  Based  on  the  EM  Algorithm,"  submitted  to  IEEE  Trans.  Signal  Proc. 

'6  K.  Zangi,  "A  New  Two  Sensor  Active  Noise  Cancellation  Algorithm,"  Proc.  ICASSP,  Minneapolis,  Minnesota,  1993. 


262  RLE  Progress  Report  Number  135 


Chapter  2.  Advanced  Television  and  Signal  Processing  Program 


Chapter  2.  Advanced  Television  and  Signal  Processing 
Program 

Academic  and  Research  Staff 

Professor  Jae  S.  Urn,  Dr.  David  Forney 

Graduate  Students 

John  G.  Apostolopoulos,  Shiufun  Cheung,  Ibrahim  A.  Hajjahmad,  John  C.  Hardwick,  Kyle  K.  Iwai,  Eddie  F. 
Lee,  Peter  A.  Monta,  Aradhana  Narula,  Julien  J.  Nicolas,  Lon  E.  Sunshine,  Chang  Dong  Yoo 

Technical  and  Support  Staff 

Debra  L.  Marring,  Cindy  LeBlanc,  Giampiero  Sciutto 


2.1  Introduction 

The  present  television  system  was  designed  nearly 
40  years  ago.  Since  then,  there  have  been  signif¬ 
icant  developments  in  technology  which  are  highly 
relevant  to  television  industries.  For  example, 
advances  in  very  large  scale  integration  (VLSI) 
technology  and  signal  processing  theories  make  it 
economically  feasible  to  incorporate  frame-store 
memory  and  sophisticated  signal  processing  capa¬ 
bilities  in  a  television  receiver.  To  exploit  new  tech¬ 
nology  in  developing  future  television  systems, 
Japan  and  Europe  have  established  large  research 
laboratories  which  are  funded  by  the  government  or 
industry-wide  consortia.  Because  the  lack  of  a 
research  laboratory  in  the  United  States  was  con¬ 
sidered  detrimental  to  the  broadcasting  and  equip¬ 
ment  manufacturing  industries,  a  consortium  of 
American  companies  established  the  Advanced 
Television  Research  Program  (ATRP)  at  MIT  in 
1983. 

Currently,  the  consortium  members  include  ABC, 
Ampex,  General  Instrument,  Kodak,  Motorola,  PBS, 
and  Tektronix.  The  major  objectives  of  the  ATRP 
are: 

1.  To  develop  the  theoretical  and  empirical  basis 
for  improving  existing  television  systems,  as 
well  as  for  designing  future  television  systems; 

2.  To  educate  MIT  students  through  television- 
related  research  and  development  while  moti¬ 
vating  them  toward  careers  in  television-related 
industries: 

3.  To  facilitate  continuing  education  of  scientists 
and  engineers  already  working  in  the  industry; 

4.  To  establish  a  resource  center  where  problems 
and  proposals  can  be  discussed  and  studied  in 
detail; 


5.  To  transfer  technology  developed  from  this 
program  to  the  industries. 

The  research  areas  of  the  program  include  (1) 
design  of  a  channel-compatible  advanced  television 
(ATV)  system,  (2)  design  of  a  receiver-compatible 
ATV  system  and  digital  ATV  system,  and  (3) 
development  of  transcoding  methods.  Significant 
advances  have  already  been  made  in  some  of 
these  research  areas.  The  digital  ATV  system  we 
designed  was  tested  in  1992  by  the  Federal  Com¬ 
munications  Commission  (FCC)  for  its  possible 
adoption  as  the  U.S.  HDTV  standard  for  terrestrial 
broadcasting.  No  decision  has  been  made  yet  on 
the  HDTV  standard. 

In  addition  to  research  on  advanced  television 
systems,  our  program  also  includes  research  on 
speech  processing.  Current  research  topics  include 
development  of  a  new  speech  model  and  algo¬ 
rithms  to  enhance  speech  degraded  by  background 
noise. 


2.2  Advanced  Television  Research 
Program 

2.2.1  ATRP  Facilities 

The  ATRP  facilities  are  currently  based  on  a 
network  of  eight  Sun-4  and  three  DecStation  5000 
workstations.  There  is  approximately  14.0  GB  of 
disk  space,  distributed  among  the  various 
machines.  Attached  to  one  of  the  Sun-4s  is  a  VTE 
display  system  with  256  MB  of  RAM.  This  display 
system  is  capable  of  driving  the  Sun-4  monitors  or 
a  29-inch  Conrac  monitor  at  rates  up  to  60 
frames/second.  In  addition  to  displaying  high- 
resolution  real-time  sequences,  the  ATRP  facilities 
include  a  Metheus  frame  buffer  which  drives  a  Sony 
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2K  X  2K  monitor.  For  hard  copy  output,  the  lab 
uses  a  Kodak  XL7700  thermal  imaging  printer 
which  can  produce  2K  x  2K  color  or  black  and 
white  images  on  11  inch  x  11  inch  photographic 
paper. 

Other  peripherals  include  an  Exabyte  8  mm  tape 
drive,  16-bit  digital  audio  interface  with  two  chan¬ 
nels  and  sampling  rates  up  to  48  kHz  per  channel, 
and  "audio  workstation"  with  power  amplifier, 
speakers,  CD  player,  tape  deck,  and  other  relevant 
equipment.  Additionally,  the  lab  has  a  650  MB 
optical  disk  drive,  a  CD-ROM  drive,  and  two  laser 
printers.  For  preparing  presentations,  the  ATRP 
facilities  also  include  a  Macintosh  SE30  microcom¬ 
puter,  Mac  llx,  and  Apple  LaserWriter. 

We  are  considering  installation  of  a  fast  network 
(FDDI)  to  augment  the  current  10  Mbps  Ethernet. 
The  new  network  would  enable  much  faster  data 
transfer  to  display  devices,  and  it  would  more  easily 
support  large  NFS  transfers. 


2.2.2  Video  Representations  for 
Low-bit-rate  Applications 

Sponsor 

Advanced  Television  Research  Program 

Project  Staff 

John  G.  Apostolopoulos 

Video  is  becoming  an  important  aspect  in  many  of 
today's  applications  and  is  expected  to  gain  even 
greater  importance  in  the  near  future.  The  large 
raw  data  rate  of  a  video  signal,  together  with  the 
limited  available  transmission  capacity  in  many 
applications,  necessitates  compression  of  the  video. 
A  number  of  image  and  video  compression  algo¬ 
rithms  have  been  developed  for  high-definition  tele¬ 
vision,  video-conferencing,  and  video-phone 
applications.  These  algorithms  create  an  efficient 
representation  of  the  video,  such  as  a  motion  field 
and  error  image,  which  is  subsequently  quantized 
and  coded  for  transmission.  Each  of  these  algo¬ 
rithms  has  been  specifically  tailored  for  its  particular 
application  and  operating  parameters. 

Personal  communication  devices  (PCDs)  and  other 
future  applications  may  have  different  sets  of  con¬ 
straints.  With  possible  power  and  bandwidth  limita¬ 
tions,  they  could  be  required  to  encode  the  video  at 
much  lower  bit  rates.  The  bit  rate  could  be  variable 
for  different  applications  or  could  depend  on  the 
particular  transmission  scenario  (conditions)  existing 
at  the  time.  For  example,  PCDs  or  other  portable 
devices  may  be  able  to  adjust  their  transmission  to 
optimally  utilize  the  available  channel  capacity.  The 


ability  to  efficiently  operate  at  very  low  bit  rates  as 
well  as  at  higher  bit  rates  is  therefore  very  impor¬ 
tant. 

A  representation  of  the  video  which  provides  a 
natural  coupling  among  different  video  acquisi¬ 
tion/display  resolutions  (temporal  and  spatial)  could 
be  very  beneficial.  For  example,  in  point-to-point 
communications,  knowledge  of  the  characteristics  of 
the  acquisition  and  display  devices  and  channel 
capacity  could  be  exploited. 

The  attribute  which  is  most  likely  of  greatest  impor¬ 
tance  for  any  video  compression  algorithm  is  a 
high-quality,  intelligible  reconstruction  of  the  impor¬ 
tant  information  existing  within  the  video.  The 
ability  to  adapt  the  representation  to  exploit  the 
instantaneous  content  of  the  video  signal  could 
improve  the  rendering  of  important  information  in 
each  scene. 

These  issues  are  much  more  crucial  for  applica¬ 
tions  at  low-bit-rates  than  those  at  higher  bit  rates. 
This  research  focuses  on  creating  a  scalable  and 
adaptive/dynamic  representation  of  the  video  signal, 
thereby  enabling  its  very  efficient  and  flexible  com¬ 
pression. 


2.2.3  Design  and  implementation  of  a 
Digital  Audio  Compression  System 

Project  Staff 

Shiufun  Cheung,  Kyle  K.  iwai,  Peter  A.  Monta 

Currently  popular  digital  sound  systems  on  the 
commercial  market,  such  as  the  compact  disc  (CD) 
system  and  the  digital  audio  tape  (DAT)  system, 
typically  deliver  uncompressed  audio  which  is  line¬ 
arly  quantized  to  16  bits  and  sampled  at  a  rate  of 
either  44.1  kHz  for  CDs  or,  more  universally,  48 
kHz.  These  systems  result  in  a  raw  data  rate  of 
over  1 .4  Mbits  per  second  per  stereo  pair.  This  is 
too  high  to  be  practical  for  applications  such  as 
high-definition  television  (HDTV)  transmission  and 
digital  audio  broadcast  (DAB),  both  of  which  have 
channel  bandwidth  constraints.  Recent  develop¬ 
ments  in  audio  coding  technology  show  that 
reduced  data  rate  a  sound  quality  comparable  to 
CD  digital  audio  could  be  achieved.  In  this  project, 
we  have  designed  and  implemented  an  audio  com¬ 
pression  system  which  delivers  perceptually  trans¬ 
parent  audio  at  approximately  1 25  kbits  per  second 
per  monophonic  channel.  This  represents  a  reduc¬ 
tion  factor  of  close  to  6:1  from  the  uncompressed 
systems. 

This  compression  scheme  performs  adaptive  trans¬ 
form  coding  by  passing  the  audio  signal  sampled  at 


264  RLE  Progress  Report  Number  135 


Chapter  2.  Advanced  Television  and  Signal  Processing  Program 


48  kHz  through  a  critically-sampled  single-sideband 
filterbank,  a  technique  also  known  in  the  literature 
as  time  domain  aliasing  cancellation.  The  dynamic 
bit  allocation  is  performed  on  each  transformed 
frame  of  samples.  This  allows  the  coder  to  adapt 
to  varying  characteristics  in  different  sounds.  To 
maintain  perceptual  transparency,  many  aspects  of 
the  algorithm  rely  on  knowledge  of  the  human 
auditory  system.  Key  features  include  critical  band 
analysis  and  exploitation  of  the  perceptual  interband 
masking  model  for  quantization  noise  shaping. 

The  channel  compatible  digicipher  system,  a  real¬ 
time  implementation  of  four  full  channels  of  the 
audio  coder,  was  completed  in  July  1992.  As  the 
audio  subsystem  of  one  of  the  system  proposals, 
we  entered  this  system  into  the  competition  for  the 
United  States  HDTV  standard.  The  real-time 
system,  which  employs  Motorola  96002  digital 
signal  processors,  features  color  displays  for  moni¬ 
toring  purposes,  direct  digital  interface  to  standard 
AES/EBU  audio  input  and  output,  and  optional 
Analog-to-Digital  and  Digital-to-Analog  conversion. 
In  October,  the  system  successfully  completed  all 
the  required  subjective  and  objective  tests  estab¬ 
lished  by  the  Advanced  Television  Testing  Center. 

Our  efforts  to  improve  the  audio  compression 
scheme  are  continuing.  Under  investigation  are  the 
possibility  of  using  adaptive  frame  lengths  for  trans¬ 
form  coding  and  the  merits  of  using  different  signal 
representations  such  as  the  wavelet  transform. 


2.2.4  Design  of  an  High-Definition 
Television  Display  System 

Sponsor 

Advanced  Television  Research  Program 

Project  Staff 

Eddie  F.  Lee 

Several  years  ago,  a  video  filter  and  display  unit 
was  built  by  graduate  students  to  aid  in  the 
development  of  an  HDTV  system.  This  unit  could 
read  large  amounts  of  digital  video  data  from 
memory,  filter  the  data,  and  display  it  at  a  high  rate 
on  a  large,  high-resolution  monitor.  Unfortunately, 
this  display  system  is  very  complex  and  highly 
unreliable,  and  there  is  very  little  documentation  to 
help  diagnose  any  problems  with  the  system. 


This  project,  which  was  completed  in  May  1992, 
involved  the  design  of  a  simpler  and  more  reliable 
display  system.  Since  little  documentation  was 
available,  work  was  done  to  analyze  the  older 
system  to  determine  its  input  specifications. 


2.2.5  Transform  Coding  for  High-Definition 
Television 

Sponsor 

Advanced  Television  Research  Program 

Project  Staff 

Ibrahim  A.  Hajjahmad 

The  field  of  image  coding  has  many  applications. 
One  area  is  the  reduction  of  channel  bandwidth 
needed  for  image  transmission  systems,  such  as 
HDTV,  video  conferencing,  and  facsimile.  Another 
area  is  reduction  of  storage  requirements.  One 
class  of  image  coders  is  known  as  an  transform 
image  coder.’  In  transform  image  coding,  an  image 
is  transformed  to  another  domain  more  suitable  for 
coding  than  the  spatial  domain.  The  transform 
coefficients  obtained  are  quantized  and  then  coded. 
At  the  receiver,  the  coded  coefficients  are  decoded 
and  then  inverse  transformed  to  obtain  the  recon¬ 
structed  image. 

The  discrete  cosine  transform  (DCT),  a  real  trans¬ 
form  with  two  important  properties  that  make  it  very 
useful  in  image  coding,  has  shown  promising 
results.^  In  the  energy  compaction  property  a  large 
amount  of  energy  is  concentrated  in  a  small  fraction 
of  the  transform  coefficients  (typically  low  frequency 
components).  This  property  allows  us  to  code  a 
small  fraction  of  the  transform  coefficients  while 
sacrificing  little  in  the  way  of  quality  and  intelligibility 
of  the  coded  images.  In  the  correlation  reduction 
property,  spatial  domain  is  a  high  correlation  among 
image  pixel  intensities.  The  DCT  reduces  this  cor¬ 
relation  so  that  redundant  information  does  not 
require  coding. 

Current  research  is  investigating  use  of  the  DCT  for 
bandwidth  compression.  New  adaptive  techniques 
are  also  being  studied  for  quantization  and  bit  allo¬ 
cation  that  can  further  reduce  the  bit  rate  without 
reducing  image  quality  and  intelligibility. 


’  J.S.  Lim,  Two-D  r^ensional  Signal  and  Image  Processing.  Englewood  Cliffs.  New  Jersey:  Prentice  Hall,  1990;  R.J.  Clarke,  Transform 
Coding  of  Images,  London:  Academic  Press,  1985. 

2  N.  Ahmed,  T,  Natarajan,  and  K.R.  Rao,  "Discrete  Cosine  Transform,"  IEEE  Trans.  Comput.  C-23:  90-93  (1974). 
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2.2.6  Video  Source  Coding  for 
High-Definition  Television 

Sponsor 

Advanced  Television  Research  Program 

Project  Staff 

Peter  A.  Monta 

Efficient  source  coding  is  the  enabling  technology 
for  high-definition  television  over  the  relatively 
narrow  channels  envisioned  for  the  new  service 
(e.g.,  terrestrial  broadcast  and  cable).  Coding  rates 
are  on  the  order  of  0.3  bits/sample,  and  high  quality 
is  a  requirement.  This  work  focuses  on  developing 
new  source  coding  techniques  for  video  relating  to 
representation  of  motion-compensated  prediction 
errors,  quantization  and  entropy  coding,  and  other 
system  issues. 

Conventional  coders  represent  video  by  using  block 
transforms  with  small  support  (typically  8x8  pixels). 
Such  independent  blocks  result  in  a  simple  scheme 
for  switching  a  predictor  from  a  motion- 
compensated  block  to  a  purely  spatial  block;  this  is 
necessary  to  prevent  the  coder  from  wasting 
capacity  in  some  situations. 

Subband  coders  of  the  multiresolution  or  wavelet 
type,  with  their  more  desirable  localization  proper¬ 
ties,  lack  "blocking"  artifacts  and  match  better  to 
motion-compensated  prediction  errors.  Since  the 
blocks  overlap,  this  complicates  this  process  of 
switching  predictors.  A  novel  predictive  coding 
scheme  is  proposed  in  which  subband  coders  can 
combine  the  benefits  of  good  representation  and 
flexible  adaptive  prediction. 

Source-adaptive  coding  is  a  way  for  HDTV  systems 
to  support  a  more  general  imaging  model  than  con¬ 
ventional  television.  With  a  source  coder  that  can 
adapt  to  different  spatial  resolutions,  frame  rates, 
and  coding  rates,  the  system  can  then  make 
tradeoffs  among  the  various  imagery  types  (for 
example,  60  frames/s  video,  24  frames/s  film,  highly 
detailed  still  images,  etc.).  In  general,  this  effort 
makes  HDTV  more  of  an  image  transport  system 
rather  than  a  least-common-denominator  format  to 
which  all  sources  must  either  adhere  or  be  hacked 
to  fit.  These  techniques  are  also  applicable  to 
NTSC  to  some  extent;  one  result  is  an  algorithm  for 
improved  chrominance  separation  for  the  case  of 
"3-2"  NTSC,  that  is,  NTSC  upsampled  from  film. 

Other  work  includes  design  and  implementation  of  a 
high-fidelity  audio  source  coder  operating  at  125 


kb/s  per  monophonic  channel.  The  coder  uses 
results  from  psychoacoustics  to  minimize  perceived 
quantization-noise  loudness.  This  system  forms 
part  of  the  hardware  submitted  by  MIT  and  General 
Instrument  for  the  United  States  HDTV  standards 
process. 


2.2.7  Error  Concealment  for  an  All-Digital 
High-Definition  Television  System 

Sponsor 

Advanced  Television  Research  Program 

Project  Staff 

Aradhana  Narula 

Broadcasting  high-definition  television  (HDTV) 
requires  transmission  of  an  enormous  amount  of 
information  within  a  highly  restricted  bandwidth 
channel.  Adhering  to  channel  constraints  necessi¬ 
tates  use  of  an  efficient  coding  scheme  to  com¬ 
press  data.  However,  compressing  data 
dramatically  increases  the  effect  of  channel  errors. 
In  the  uncompressed  video  representation,  a  single 
channel  error  affects  only  one  pixel  in  the  received 
image.  In  the  compressed  format,  a  channel  error 
affects  a  block  of  pixels  in  the  reconstructed  image, 
perhaps  even  an  entire  frame. 

One  way  to  combat  the  effect  of  channel  errors  is 
to  add  well-structured  redundancy  to  the  data 
through  channel  coding.  Error  correction  schemes 
generally,  however,  require  transmitting  a  significant 
number  of  additional  bits.  For  a  visual  product  like 
HDTV,  it  may  not  be  necessary  to  correct  all  errors. 
Instead,  removing  the  subjective  effects  of  channel 
errors  using  error  concealment  techniques  may  be 
sufficient,  and  these  techniques  require  fewer  addi¬ 
tional  bits  for  implementation.  Error  concealment 
may  also  be  used  in  conjunction  with  error  cor¬ 
rection  coding.  For  example,  it  may  be  used  to 
conceal  errors  which  the  error  correction  codes  can 
detect  but  not  correct. 

Error  concealment  techniques  take  advantage  of 
the  inherent  spatial  and  temporal  redundancy  within 
transmitted  data  to  remove  the  subjective  effects  of 
these  errors  once  their  location  has  been  deter¬ 
mined.  In  this  research,  error  concealment  tech¬ 
niques  were  developed  and  analyzed  to  help 
protect  the  system  from  errors  occurring  in  several 
parameters  transmitted  for  HDTV  images.  Specif¬ 
ically,  error  concealment  in  the  motion  vectors  and 
the  discrete  cosine  transform  (DCT)  coefficients 
was  investigated. 
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2.2.8  Transmission  of  High*Definition 
Television  Signals  in  a  Terrestrial 
Broadcast  Environment 

Sponsor 

Advanced  Television  Research  Program 

Project  Staff 

Julien  J.  Nicolas 

High-definition  television  systems  currently  being 
developed  for  broadcast  applications  require  15-20 
Mbps  to  yield  good  quality  images  for  approximately 
twice  the  horizontal  and  vertical  resolutions  of  the 
current  NTSC  standard.  Efficient  transmission  tech¬ 
niques  must  be  found  to  deliver  this  signal  to  a 
maximum  number  of  receivers  while  respecting  limi¬ 
tations  stipulated  by  the  FCC  for  over-the-air  trans¬ 
mission.  This  research  focuses  on  the  principles 
that  should  guide  the  design  of  such  transmission 
systems. 

The  major  constraints  related  to  the  transmission  of 
broadcast  HDTV  include  (1)  bandwidth  limitation  (6 
MHz,  identical  to  NTSC);  (2)  requirement  for  simul¬ 
taneous  transmission  of  both  NTSC  and  HDTV 
signals  on  two  different  channels  (Simulcast 
approach);  and  (3)  tight  control  of  the  interference 
effects  between  NTSC  and  HDTV,  particularly  when 
the  signals  are  sharing  the  same  frequency  bands. 
Other  considerations  include  complexity  and  cost 
issues  of  the  receivers,  degradation  of  the  signal  as 
a  function  of  range,  etc. 

A  number  of  ideas  are  currently  being  studied. 
Most  systems  proposed  to  date  use  some  form  of 
fonward  error-correction  to  combat  channel  noise 
and  interference  from  other  signals.  Overhead  data 
reserved  for  the  error-correction  schemes  repre¬ 
sents  up  to  30  percent  of  the  total  data,  and  it  is 
therefore  worthwhile  trying  to  optimize  these 
schemes.  Our  current  work  is  focusing  on  the  use 
of  combined  modulation/coding  schemes  capable  of 
exploiting  the  specific  features  of  the  broadcast 
channel  and  the  interference  signals.  Other  areas  of 
interest  include  use  of  combined  source/channel 
coding  schemes  for  HDTV  applications  and  multi¬ 
resolution  coded  modulation  schemes. 


2.2.9  Fractal  Image  Compression 
Sponsor 

Advanced  Television  Research  Program 

Project  Staff 

Lon  E.  Sunshine 


Image  compression  using  transform  coding  has 
been  a  wide  area  of  research  over  the  last  few 
years.  The  wavelet  transform,  in  particular,  decom¬ 
poses  a  signal  in  terms  of  basis  functions  which  are 
scaled  versions  of  one  another.  This  allows  us  to 
exploit  the  scale  invariance  or  self-similarity  within 
an  image  to  achieve  effective  compression. 

The  self-similarity  and  pattern  within  images  is  a 
quality  which  may  allow  for  high  compression  ratios 
with  little  quality  degradation.  Iterated  function 
systems  (IPS)  have  been  shown  to  synthesize 
many  self-similar  (fractal)  images  with  very  few 
parameters.  We  are  investigating  the  feasibility  of 
using  IFSs  and  their  variants  to  exploit  the  self¬ 
similarity  in  arbitrary  images  in  order  to  represent 
these  images  reliably  with  few  parameters. 


2.3  Speech  Signal  Processing 


2.3.1  A  Dual  Excitation  Speech  Model 
Sponsor 

U.S.  Navy  -  Office  of  Naval  Research 
Contract  N00014-89-J-1489 

Project  Staff 

John  C.  Hardwick 

One  class  of  speech  analysis/synthesis  systems 
(vocoders)  which  have  been  extensively  studied 
and  used  in  practice  are  based  on  an  underlying 
model  of  speech.  Even  though  traditional  vocoders 
have  been  quite  successful  in  synthesizing 
intelligible  speech,  they  have  not  successfully  syn¬ 
thesized  high  quality  speech.  The  multi-band  exci¬ 
tation  (MBE)  speech  model,  introduced  by  Griffin, 
improves  the  quality  of  vocoder  speech  through  the 
use  of  a  series  of  frequency  dependent  voiced/un¬ 
voiced  decisions.  The  MBE  speech  model, 
however,  still  results  in  a  loss  of  quality  as  com¬ 
pared  to  the  original  speech.  This  degradation  is 
caused  in  part  by  the  voiced/unvoiced  decision 
process. 

A  large  number  of  frequency  regions  contain  a  sub¬ 
stantial  amount  of  both  voiced  and  unvoiced 
energy.  If  a  region  of  this  type  is  declared  voiced, 
then  a  tonal  or  hollow  qualify  is  added  to  the  syn¬ 
thesized  speech.  Similarly,  if  the  region  is  declared 
unvoiced,  then  additional  noise  occurs  in  the  syn¬ 
thesized  speech.  As  the  signal-to- noise  ratio 
decreases,  classification  of  speech  as  either  voiced 
or  unvoiced  becomes  more  difficult,  and,  conse¬ 
quently,  degradation  is  increased. 
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The  dual  excitation  (DE)  speech  model,  due  to  its 
dual  excitation  and  filter  structure,  has  been  pro¬ 
posed  in  response  to  these  problems.  The  DE 
speech  model  is  a  generalization  of  most  previous 
speech  models,  and,  with  proper  selection  of  model 
parameters,  it  reduces  to  either  the  MBE  speech 
model  or  to  a  variety  of  more  traditional  speech 
models. 

Current  research  is  examining  the  use  of  this  model 
for  speech  enhancement,  time  scale  modification, 
and  bandwidth  compression.  Additional  areas  of 
study  include  further  refinements  to  the  model  and 
improvements  in  the  estimation  algorithms. 


2.3.2  Speech  Enhancement  Using  the  Dual 
Excitation  Model 

Sponsor 

U.S.  Navy  -  Office  of  Naval  Research 
Contract  N00014-89-J-1489 

Project  Staff 

Chang  Dong  Yoo 

Degradation  caused  by  additive  wideband  acoustic 
noise  is  common  in  many  communication  systems 
where  the  disturbance  varies  from  low-level  office 
noise  in  a  normal  phone  conversation  to  high- 
volume  engine  noise  in  a  helicopter  or  an  airplane. 
In  general,  addition  of  noise  reduces  intelligibility 
and  introduces  listener  fatigue.  Consequently,  it  is 
desirable  to  develop  an  automated  speech 
enhancement  procedure  for  removing  this  type  of 
noise  from  the  speech  signal. 

Many  different  types  of  speech  enhancement 
systems  have  been  proposed  and  tested.  The  per¬ 
formance  of  these  systems  depends  upon  the  type 
of  noise  they  are  designed  to  remove  and  the  infor¬ 
mation  which  they  require  about  the  noise.  The 
focus  of  our  work  has  been  on  the  removal  of  wide¬ 
band  noise  when  only  a  single  signal  consisting  of 
the  sum  of  the  speech  and  noise  is  available  for 
processing. 

Due  to  the  complexity  of  the  speech  signal  and  the 
limitations  inherent  in  many  previous  speech 


models,  model-based  speech  analysis/synthesis 
systems  are  rarely  used  for  speech  enhancement. 
Typically,  model-based  speech  enhancement 
systems  introduce  artifacts  into  the  speech  and  the 
quality  degrades  as  the  signal-to-noise  ratio 
decreases.  As  a  consequence,  most  speech 
enhancement  systems  to  date  have  attempted  to 
process  the  speech  waveform  directly  without 
relying  on  an  underlying  speech  model. 

One  common  speech  enhancement  method  is 
spectral  subtraction.  The  basic  principle  behind  this 
method  is  to  attenuate  frequency  components 
which  are  likely  to  have  a  low  speech-to-noise  ratio 
while  leaving  frequency  components  which  are 
likely  to  have  a  high  speech-to-noise  ratio  relatively 
unchanged.  Spectral  subtraction  is  generally  con¬ 
sidered  to  be  effective  at  reducing  the  apparent 
noise  power  in  degraded  speech.  However,  noise 
reduction  is  achieved  at  the  price  of  speech  intelli¬ 
gibility.  Moderate  amounts  of  noise  reduction  can 
be  achieved  without  significant  loss  of  intelligibility, 
but  a  large  amount  of  noise  reduction  can  seriously 
degrade  intelligibility.  The  attenuation  characteristics 
of  spectral  subtraction  typically  lead  to  a  de¬ 
emphasis  of  unvoiced  speech  and  high  frequency 
formants.  This  property  is  probably  one  of  the  prin¬ 
cipal  reasons  for  loss  of  intelligibility.  Other  dis¬ 
tortions  introduced  by  spectral  subtraction  include 
tonal  noise. 

In  this  work,  we  introduce  a  new  speech  enhance¬ 
ment  system  based  on  the  DE  speech  model  which 
overcomes  some  of  these  problems.  The  DE 
system  is  used  to  separate  speech  into  voiced  and 
unvoiced  components.  Since  the  acoustic  back¬ 
ground  noise  has  characteristics  which  are  similar 
to  unvoiced  speech,  the  unvoiced  component  will 
be  principally  composed  of  the  unvoiced  speech 
plus  the  background  noise.  The  voiced  component 
will  be  principally  composed  of  the  harmonic  com¬ 
ponents  of  the  speech  signal.  As  a  consequence, 
speech  enhancement  can  be  achieved  through  sub¬ 
sequent  processing  of  the  unvoiced  component  to 
reduce  the  apparent  noise  level.  New  processing 
methods  have  been  derived  which  take  advantage 
of  the  unique  properties  of  the  individual  compo¬ 
nents  to  reduce  distortion  introduced  into  the  pro¬ 
cessed  speech. 
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3.1  Project  Description 

The  purpose  of  this  project  is  to  develop  a  high- 
definition  television  system  (HDTV)  for  terrestrial 
broadcasting  that  has  high-spectrum  efficiency, 
effective  suppression  of  analog  channel  impair¬ 
ments  (ghosts,  noise,  interference,  and  frequency 
distortion),  a  feasible  transition  scenario,  reasonable 
cost,  good  interoperability  with  other  media  and 
applications,  and  permits  nondistruptive  improve¬ 
ment  over  time.  To  this  end,  a  two-stage  spread- 
spectrum  system  has  been  simulated  in  accordance 
with  the  plans  discussed  in  RLE  Progress  Report 
Number  134. 

In  spread-spectrum  television,  the  video  signal  is 
divided  into  a  large  number  of  narrow-band  compo¬ 
nents  so  that  the  symbol  length  is  larger  than  the 
temporal  spread  of  the  echoes.  All  echoes  arrive 
within  one  symbol  time,  eliminating  intersymbol 
interference.  Each  narrow-band  component  is 
spread  to  the  full  spectrum  width  (6  MHz)  by  multi¬ 
plying  it  with  a  different  pseudorandom  sequence. 
All  the  products  are  added  and  transmitted.  At  the 
receiver,  the  components  are  separated  by  multipli¬ 
cation  by  the  same  sequences,  whose  orthogonality 
ensures  elimination  of  crosstalk. 

Unfortunately,  echoes  reduce  the  orthogonality  of 
the  random  sequences.  While  scrambling  is  helpful 
because  it  causes  interference  to  become  random 
noise,  the  noise  level  is  still  excessive.  It  is  not  the¬ 
oretically  possible  to  design  sequences  that  have 
adequate  cross-correlation  performance  when  time- 
shifted. 

We  have  found  a  possible  solution  to  this  problem 
by  using  orthogonal  frequency-division  multiplex 
(OFDM).  This  channel-coding  method,  invented  at 
Bell  Laboratories  in  the  1960s,  also  divides  the 
signal  into  narrow-band  components.  Each  signal 
is  modulated  on  a  separate  carrier.  The  signals  in 


adjacent  channels  are  orthogonal,  permitting  their 
complete  separation  at  the  receiver.  A  guard 
interval  is  inserted  between  symbols  to  ensure  that 
all  ghosts  arrive  entirely  within  one  integration 
period.  A  very  simple  implementation  is  provided 
by  the  use  of  the  Fourier  transform.  This  generates 
a  baseband  version  of  the  transmitted  signal  in  a 
single  operation. 

OFDM,  which  has  been  used  in  Europe  for  digital 
audio  broadcasting,  is  being  investigated  now  for 
digital  video  broadcasting  by  a  number  of  laborato¬ 
ries.  At  a  meeting  held  at  MIT  in  October  1992,  all 
European  and  Canadian  OFDM  projects  were 
represented.  A  particularly  attractive  version  of 
OFDM,  called  single-frequency  networks  (SFN), 
was  discussed  at  some  length.  In  this  method,  the 
receiving  area  of  a  television  station  is  filled  with  a 
cellular  network  of  low-power  transmitters,  operating 
on  the  same  frequency.  The  multiple  signals  seen 
at  each  receiver,  particularly  if  omnidirectional 
receiving  antennas  are  used,  constitute  "active 
echoes,"  which  are  suppressed  by  OFDM.  This 
arrangement  results  in  a  much  more  unifc'm  signal 
level  at  the  receivers,  obviating  the  need  foi  a  soft 
threshold.  It  also  permits  much  higher  spectrum 
efficiency,  since  stations  serving  contiguous  areas 
can  use  the  same  frequency.  The  "no-man’s  land" 
between  stations,  in  which  directional  antennas 
would  have  to  be  used,  are  about  the  same  width 
as  the  cells.  Thus,  to  provide  about  20  inde¬ 
pendent  services  in  each  locality,  not  much  more 
than  20  (instead  of  the  68  which  are  needed  at 
present)  channels  must  be  allocated  for  television 
service. 

We  are  now  actively  pursuing  the  goal  of  config¬ 
uring  our  system  to  support  the  SFN  concept. 
There  is  some  uncertainty  about  the  precise 
manner  in  which  the  echoes  combine  at  the 
receiver  and  we  are  considering  a  field  test  to  eluci¬ 
date  the  phenomenon. 
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1 .1  Introduction 

The  primary  objective  of  this  cooperative  work  with 
the  Houston  Advanced  Research  Center  is  to 
develop  a  novel  method  for  automated,  low-cost, 
high-throughput  DNA  sequence  analysis.  The 
overall  goal  is  to  demonstrate  laboratory  prototypes 
that  provide  a  substantial  increase  in  speed  over 
the  conventional  DNA  sequencing  methods  now 
used  in  the  biomedical,  pharmaceutical,  and  agricul- 
hiral  industries. 

The  basic  approach  being  taken  is  depicted  in 
figure  1 .  In  a  hypothetical  DNA  sequencing  test,  a 
solution  of  single-stranded  "target"  DNA  strands  of 
identical  but  unknown  sequence  is  washed  onto  a 
specialized  microelectronic  chip  called  a  geno- 
senscr.  The  genosensor  surface  contains  a  Urge 
array  of  test  sites,  each  site  containing  short  pieces 
of  single-stranded  DNA  known  as  "probes."  These 
probes  are  chemically  attached  to  the  site.  All 
probes  in  a  given  site  are  of  like  sequence,  and  the 
sequence  for  each  site  is  unique  on  the  chip.  The 
target  DNA  strands  will  bond,  or  hybridize,  strongly 
to  those  sites  containing  probes  having  an  exact 
complementary  sequence  match  but  will  hybridize 
much  less  strongly  to  other  sites.  The  sites  con¬ 
taining  hybridized  DNA  are  identified  via  electronic 
sensing  on  the  chip,  and  this  information  is  used  by 
off-chip  instrumentation  to  reconstruct  the  sequence 
of  the  target  strands.  MIT's  role  in  this  effort  is  pri¬ 
marily  the  design  and  fabrication  of  the  genosensor 
chips. 


1.2  Development  of  Genosensor  Arrays 
for  DNA  Decoding 

Sponsor 

Houston  Advanced  Research  Center 
Contract  HRC-HG00665-L'1 

Project  Staff 

Dr.  Daniel  J.  Ehrlich,  Dr.  Dennis  D.  Rathman,  Dr. 
Mark  A.  Hollis,  Dr.  John  Meingailis 

The  simplest  electrical  measurement  that  can  be 
made  at  a  test  r'te  to  detect  hybridization  is  pro¬ 
bably  a  measurement  of  the  change  in  local 
permittivity  due  to  the  addition  of  long  target  strands 
to  the  site.  The  complex  permittivity  p'-je"  of  an 
aqueous  solution  containing  DNA  exhibits  a  disper¬ 
sion  around  a  relaxation  frequency  which  is  a  func¬ 
tion  of  the  size  (diameter  in  solution)  of  the  DNA 
molecule.  A  measurement  of  the  capacitance 
and/or  conductance  between  two  electrodes  in  the 
solution  over  a  range  of  frequency  can  therefore  dif¬ 
ferentiate  between  a  site  that  contains  only  short 
probe  strands  and  one  that  contains  long  target 
strands  hybridized  to  the  probe  strands. 

Figure  2  shows  a  simple  conceptual  view  of  a 
permittivity  genosensor  wafer  that  consists  of  rows 
and  columns  of  overlapping  metal  lines  having  a 
test  site  (well)  at  each  crossover.  The  two  metal 
interconnect  lines  at  each  well  have  one  or  more 
electrodes  that  protrude  into  the  well.  A  measure¬ 
ment  is  made  by  applying  a  signal  to  the  two  lines 
for  a  desired  well  a.  id  measuring,  for  example,  the 
dissipation  factor  t  'It'  of  the  well  capacitance  using 
an  RLC  meter.  In  this  manner,  all  the  wells  of  the 
array  can  be  rapidly  interrogated  in  sequence. 

The  ideal  electroc'^  structure  in  a  test  well  consists 
of  two  parallel  plates  spaced  so  that  the  entire 
volume  between  them  is  filled  by  the  hybridized 
DNA  globules  in  aqueous  solution.  For  the  sizes  of 
target  DNA  envisioned,  this  spacing  ranges  from 
approximately  200  to  a  few  thousand  angstroms.  A 
practical,  easily  fabricated  structure  that  approxi¬ 
mates  this  ideal  is  an  interdigitated  design  shown  in 
figure  3.  Fabricated  by  a  combination  of  wet  and 
dry  etching  with  metal  liftoff,  this  design  can 
achieve  the  required  spacings  between  the  upper 
and  lower  electrodes  at  their  edges.  Results  of 
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Figure  2.  Conceptual  view  of  a  microelectronic  geno- 
sensor. 
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Figure  3.  Electrode  design  for  a  permittivity  genosensor. 
The  basic  unit  cell  shown  is  repeated  many  times  across 
a  test  well  to  form  an  ihterdigitated  test  structure  with  the 
top  Au  electrodes  connected  to  one  access  line  and  the 
bottom  Au  electrodes  to  the  other. 


1.3  Microdetection  Technology  for 
Automated  DNA  Sequencing 

Sponsor 

Houston  Advanced  Research  Center 
Contract  HRC-HG00776-01 

Project  Staff 

Dr.  Daniel  J.  Ehrlich,  Dr.  Dennis  D.  Rathman,  Dr. 
Mark  A.  Hollis,  Dr.  John  Meingailis 

The  primary  emphasis  of  this  effort  is  to  comple¬ 
ment  the  program  described  above.  We  are  per¬ 
forming  extensive  computer  modeling  combined 
with  detailed  electrical  measurements  of  parasitics 
to  develop  a  high-sensitivity  electrode  geometry  for 
a  permittivity  genosensor.  The  computer  modeling 
has  begun,  and  a  new  set  of  photomasks  has  been 
designed  and  fabricated  in  which  the  dimensions  of 
the  electrodes  are  varied  to  enable  experimental 
determination  of  the  optimum  electrode  geometry. 
These  photomasks  are  now  being  used  to  fabricate 
genosensor  arrays  for  the  electrical  measurements. 
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1.1  Introduction 

The  overall  objective  of  our  research  in  speech 
communication  is  to  gain  an  understanding  of  the 
processes  whereby  (1)  a  speaker  transforms  a  dis¬ 
crete  linguistic  representation  of  an  utterance  into 
an  acoustic  signal,  and  (2)  a  listener  decodes  the 
acoustic  signal  to  retrieve  the  linguistic  represen¬ 
tation.  The  research  includes  development  of 
models  for  speech  production,  speech  perception, 
and  lexical  access,  as  well  as  studies  of  impaired 
speech  communication. 

Sponsors 

C.J.  Lebel  Fellowship 
Dennis  Klatt  Memorial  Fund 
Digital  Equipment  Corporation 
National  Institutes  of  Health 
Grant  R01-DC00075 
Grant  R03-DC01721 
Grant  R01-DC01291 
Grant  R01-DC00266'5 
Grant  P01-DC00361-06A1'5 
Contract  R01-DC00776'® 

National  Science  Foundation 
Grant  IRI  89-05249'^ 

Grant  IRI  89-10561 
Grant  INT  90-24713'® 


1.2  Speech  Production  Planning 


1.2.1  Segmental  Planning 

Speech  errors  like  "lightly  britt"  (for  "brightly  lit") 
suggest  that  sublexical  elements  undergo  serial 
ordering  in  preparation  for  production,  and  contex¬ 
tual  modifications  that  accommodate  such  errors 
(like  "an  ignificant  amount"  (for  "a  significant 
amount")  suggest  that  further  phonological  pro¬ 
cessing  follows  this  serial  ordering.  Our  proposed 
model  for  sublexical  serial  ordering  postulates  a  set 
of  abstract  phonemic  elements  provided  by  the  can¬ 
didate  lexical  items  for  the  utterance,  a  prosodically- 
structured  phrasal  frame  of  slots  or  locations  to 
which  the  individual  segment  specifications  must  be 
associated,  and  a  mechanism  for  integrating  the 
segments  with  their  slots.  This  model  includes  a 


word-based  search  process  through  the  set  of  can¬ 
didate  elements:  other  models  suggest  a  syllable- 
based  search.  Distinguishing  between  these  two 
approaches  is  difficult  on  the  basis  of  existing 
corpora  of  sublexical  errors.  The  majority  of 
frequently-used  words  of  English  are  monosyllables 
so  that  position  similarity  constraints  on  interacting 
pairs  of  segments  (initial  with  initial,  final  with  final, 
etc.)  are  compatible  with  both  word  and  syllable 
structure.  In  a  series  of  error  elicitation  exper¬ 
iments,  however,  we  have  shown  that  errors  are 
more  likely  between  pairs  of  word-initial  consonants 
than  between  pairs  that  are  not  both  word-initial, 
even  when  both  members  of  the  pair  are  syllable 
initial.  Thus,  word  structure  must  play  a  role  in  the 
serial  ordering  process  for  sublexical  elements: 
syllable-based  structure  alone  cannot  account  for 
the  distribution  of  segmental  errors. 

Additional  experiments  in  progress  explore  the  pos¬ 
sibility  that  word-final  segments,  which  seldom  par¬ 
ticipate  in  sublexical  errors,  are  protected  against 
such  errors  by  the  phrasal  representations  of  syn¬ 
tactically  and  prosodically  well-formed  utterances 
but  not  by  the  planning  representations  prepared 
for  word  lists.  We  are  also  developing  methods  to 
test  the  hypothesis  that  more  sublexical  errors 
occur  when  phrasal  prosody  is  irregular  and  less 
rhythmic,  and  fewer  errors  when  phrasal  prosody  is 
highly  regular  and  rhythmic.  Such  a  finding,  sug¬ 
gested  by  our  preliminary  results,  would  indicate 
that  the  two  mechanisms  of  segmental  serial 
ordering  and  computation  of  phrasal  prosody  invoke 
some  of  the  same  processing  capacity,  as  our 
model  postulates. 


1.2.2  Prosodic  Planning 

Although  the  major  perceptual  prominence  of  a 
word  of  American  English  falls  on  its  main  stress 
syllable  when  the  word  is  produced  by  itself  (as  in 
"indeSCRIbably"),  in  running  speech  such 
polysyllabic  words  with  late  stress  often  have  a 
stronger  prominence  on  an  earlier  syllable  (as  in 
"That  was  INdescribably  TASty").  This  phenom¬ 
enon,  formerly  described  as  Stress  Shift,  has  been 
described  as  the  shift  of  stress  leftward  from  the 
main-stress  syllable  to  relieve  a  Stress  Clash, 
created  by  the  juxtaposition  of  the  main-stress  sylla- 
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bles  of  adjacent  words  within  the  same  phrase.  In 
acoustic  and  perceptual  analyses  of  pairs  of  sen¬ 
tences  read  aloud  in  the  laboratory,  as  well  as  of 
more  than  750  utterances  in  a  corpus  of  FM  radio 
news  style  speech  recorded  at  broadcast,  we  have 
shown  that  many  instances  of  apparent  stress  shift 
are  better  described  in  terms  of  the  placement  of 
phrase-level  tonal  prominences  called  pitch 
accents.  That  is,  apparent  stress  shift  reflects  the 
unmasking  of  an  early  pitch  accent  in  the  target 
word,  an  unmasking  which  is  brought  about  by  the 
disappearance  of  the  pitch  accent  from  the  main 
stress  syllable.  We  have  also  found  evidence  to 
support  the  claim  that  the  placement  of  a  pitch 
accent  on  the  early  syllable  of  the  word  is  affected 
by  two  separate  prosodic  factors:  a  tendency  to 
place  a  pitch  accent  as  early  as  possible  in  a  new 
prosodic  phrase  (onset  marking)  and  a  tendency  to 
avoid  placing  pitch  accents  too  dose  together  (pitch 
accent  clash).  The  possibility  that  the  more  tradi¬ 
tional  form  of  rhythmic  stress  shift  also  occurs  in 
non-pitch-accented  stretches  of  speech  is  under 
investigation.  This  work  has  been  carried  out  in 
conjunction  with  Dr.  Mari  Ostendorf  at  Boston  Uni¬ 
versity  and  Dr.  Patti  Price  at  SRI  International. 
Published  in  the  proceedings  of  several  meetings 
and  workshops,  this  work  has  been  submitted  for 
journal  publication. 

We  are  also  exploring,  in  conjunction  with  a  number 
of  other  laboratories,  the  feasibility  of  a  new  tran¬ 
scription  system  for  prosodic  structure — the  TOBI 
system  (for  TOnes  and  Break  Indices).  This 
system  represents  the  effort  of  a  number  of  investi¬ 
gators  to  develop  a  method  of  transcribing  the  core 
prosodic  facts  about  an  utterance  including  the  con¬ 
stituent  boundaries,  boundary  tones,  and  promi¬ 
nences.  We  developed  this  system  for  use  by  a 
large  number  of  laboratories  involved  in  prosody 
research  to  more  quickly  assemble  a  large  data¬ 
base  of  prosodically  labeled  speech.  We  are  cur¬ 
rently  involved  in  testing  whether  inexperienced 
learners  can  learn  to  use  TOBI  in  a  reasonable 
amount  of  time.  This  collaborative  work  is 
attempting  to  develop  teaching  materials  and  a 
manual  and  assess  the  reliability  of  the  tran¬ 
scriptions  across  different  listeners  and  speech 
materials. 


1.3  Studies  of  Speech  Production 


1.3.1  Trading  Relations  between 
Tongue-body  Raising  and  Lip  Rounding  in 
Production  of  the  Vowel  /u/:  A  Pilot  Motor 
Equivalence  Study 

Articulatory  and  acoustic  data  were  used  to  explore 
the  following  hypothesis  for  the  vowel  /u/:  the  goal 
of  articulatory  movements  is  a  relatively  invariant 
acoustic  target  which  may  be  achieved  with  varying 
and  reciprocal  contributions  of  different  articulators. 
Previous  articulatory  studies  of  similar  hypotheses, 
expressed  entirely  in  articulatory  terms,  have  been 
confounded  by  interdependencies  of  the  variables 
being  studied  (e.g.,  lip  and  mandible  displace¬ 
ments).  One  case  in  which  this  problem  may  be 
minimized  is  that  of  lip  rounding  and  tongue-body 
raising  (formation  of  a  velo-palatal  constriction)  for 
the  vowel  /u/.  Lip  rounding  and  tongue-body  raising 
should  have  similar  acoustic  effects  for  /u/,  mainly 
to  lower  F2.  In  multiple  repetitions,  reciprocal  con¬ 
tributions  of  lip  rounding  and  tongue-body  raising 
could  help  constrain  F2  variability  for  /u/;  thus,  this 
experiment  looked  for  complementary  covariation 
(negative  correlations)  in  measures  of  these  two 
parameters.  An  Electro-Magnetic  Midsagittal 
Articulometer  (EMMA)  was  used  to  track  move¬ 
ments  of  midsagittal  points  on  the  tongue  body, 
upper  and  lower  lips,  and  mandible  for  large 
numbers  of  repetitions  of  utterances  containing  /u/. 
Three  of  four  subjects  showed  weak  negative  corre¬ 
lations,  tentatively  supporting  the  hypothesis:  a 
fourth  showed  the  opposite  pattern — positive  corre¬ 
lations  (coordination)  of  lip  rounding  and  tongue 
raising.  The  results  are  discussed  in  a  recent  publi¬ 
cation  including  ideas  about  motor  equivalence,  the 
nature  of  speech  motor  programming,  and  potential 
improvements  to  the  paradigm. 


1.3.2  Tongue  Surface  Deformation  during 
Obstruent  Stop  Consonants 

When  an  obstruent  stop  consonant  is  produced, 
there  is  an  increase  in  intraoral  pressure,  and  this 
increased  pressure  can  result  in  outward  movement 
of  the  vocal-tract  walls.  Following  release  of  the 
consonant,  the  intraoral  pressure  decreases,  and 
the  walls  are  expected  to  move  inward  with  a  time 
constant  that  depends  on  the  physical  properties  of 
the  surfaces.  We  used  an  electromagnetic 
midsagittal  articulometer  (EMMA)  to  measure  the 
vertical  position  of  the  tongue  dorsum  during  a 
number  of  repetitions  of  five  utterances:  /ama/, 
/apa/,  /aba/,  /ampa/  and  /amba/.  Results  seem  to 
confirm  the  physical  description  given  above:  the 
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average  lowering  in  tongue  dorsum  position  fiom 
implosion  to  release  of  the  consonant  was  0.4  mm 
for  /m/,  2.7  mm  for  /b/  and  1.3  mm  for  /p/.  The 
smaller  movement  for  /p/  than  for  /b/  may  be  due  to 
active  stiffening  of  the  tongue  during  /p/,  partly 
counteracting  the  effect  due  to  intraoral  pressure. 
These  displacements  and  their  rates  of  movement 
are  consistent  with  published  data  on  average 
subglottal  pressure  and  compliance  and  resistance 
of  vocal  tract  surfaces.  These  data  have  been 
used  to  refine  current  estimates  of  physical  para¬ 
meters  of  articulator  surfaces  during  speech  pro¬ 
duction. 


1.3.3  Advances  in  Techniques  for  Studying 
Temporai  Aspects  of  Inter-articulator 
Coordination 

We  are  expanding  previous  analyses  of  anticipatory 
labial  coarticulation  (Perkell  and  Matthies,  1992)  to 
study  the  more  general  issue  of  inter-articulatory 
coordination.  For  this  purpose,  we  have  acquired 
preliminary  data  on  movements  of  the  lips,  tongue 
and  mandible  from  two  subjects.  In  order  to 
analyze  these  and  future  data,  we  have  made  the 
following  improvements  to  signal  processing  and 
data  extraction  procedures:  (1)  Filter  selection  and 
signal  processing  to  produce  velocity  and  acceler¬ 
ation  signals  that  are  sufficiently  smooth  for  reliable 
algorithmic  event  detection  but  retain  analyzable 
peaks.  This  signal  processing  calculates  velocity 
and  acceleration  magnitudes  as  well  as  one¬ 
dimensional  measures.  (2)  Program  development 
of  a  module  to  facilitate  the  labeling  of  the  acoustic 
signals  and  marking  of  acoustic  events  prior  to  the 
analysis  of  the  physiological  signals.  This  module 
allows  for  the  review  cf  assigned  labels  and  the 
elimination  of  mispronounced  tokens.  It  has  been 
tested  and  used  extensively.  (3)  Development  of 
program  modules  to  accomplish  algorithmic 
detection  of  events  in  one  and  two-dimensional  dis¬ 
placement,  velocity  and  acceleration  signals  from 
multiple  articulators.  (4)  Development  of  a  program 
which  allows  for  ensemble  averaging  of  multiple 
articulator  movement  versus  time  trajectories. 
Standard  deviation  calculations  are  included  as  well 
as  displays  of  multiple  y  versus  x  trajectories. 


1.3.4  Sequence  in  Simultaneity:  Phonetic 
"Enhancements"  in  Evye  Doubly-articulated 
Stops 

This  project  is  being  done  in  collaboration  with  Ian 
Maddieson,  Department  of  Linguistics,  University  of 
California  at  Los  Angeles. 


The  timing  of  labial  and  velar  articulations  in  the 
doubly-articulated  stops  /kp/  and  /gb/  in  Ewe  was 
investigated  with  EMMA  (articulatory)  data  from  two 
speakers.  The  results  confirmed  the  expected 
temporal  asymmetry  of  the  two  gestures,  as  well  as 
the  general  similarity  of  these  to  the  gestures  in 
simple  velar  and  labial  stops.  Backward  movement 
of  the  tongue  in  /kp/  and  /gb/  was  observed,  but  the 
timing  and  spatial  orientation  of  this  movement  as 
well  as  certain  acoustic  details  suggest  that,  for 
these  speakers,  this  is  due  to  rarefaction  in  the 
pharyngeal  cavity  rather  than  to  an  active  attempt 
to  expand  the  front  cavity.  The  maneuvers  involved 
in  production  of  /kp/  and  /gb/  result  in  asymmetrical 
acoustic  transitions  to  and  from  the  consonant, 
additional  duration,  less  aspiration  for  /kp/,  and 
greater  voicing  for  /gb/  in  comparison  with  singly 
articulated  stops.  All  of  these  can  be  considered 
"enhancements"  of  what  would  othenwise  be  a  non¬ 
salient  contrast. 


1.4  Speech  Research  Related  to 
Special  Populations 

1.4.1  Degradation  of  Speech  and  Hearing 
with  Bilateral  Acoustic  Neuromas 

In  this  project  we  are  studying  the  relation  between 
speech  and  hearing  in  people  who  become  deaf 
from  bilateral  acoustic  neuromas  (auditory  nen/e 
tumors)  associated  with  the  genetic  disorder, 
neurofibromatosis  type  II  (NF2).  The  primary  goal 
of  this  study  is  to  increase  our  understanding  of  the 
role  of  hearing  in  the  control  of  adult  speech  pro¬ 
duction.  The  rationale  and  approach  for  this  work 
are  similar  to  those  of  our  ongoing  work  on  the 
speech  production  of  cochlear  implant  patients. 
Speech  acoustic  and  physiological  parameters  will 
be  recorded  and  speech  perception  will  be  tested  in 
a  group  of  (still  hearing)  NF2  patients.  Then  the 
same  parameters  will  be  recorded  at  intervals  for 
the  subset  of  patients  who  suffer  further  hearing 
loss. 

Thus  far,  we  have  developed  complete  protocols  for 
recording  speech  acoustic  and  physiological  para¬ 
meters,  as  well  as  procedures  for  testing  non¬ 
speech  oral-motor  capabilities  (as  control 
measures).  We  have  obtained  baseline  recordings 
from  25  patients.  Two  to  three  baseline  recordings 
have  been  obtained  from  those  facing  imminent 
hearing  loss.  Two  of  these  patients  have  recently 
undergone  acoustic  neuroma  surgery.  One  patient 
has  more  hearing  loss  as  a  result  of  the  surgery. 
The  other  suffered  complete  hearing  loss  but  now 
has  an  auditory  brainstem  implant  which  will 
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provide  some  degree  of  (degraded)  hearing.  Both 
of  these  patients  will  return  soon  for  further  testing. 


1.4.2  Speech  Production  of  Cochlear 
Implant  Patients 

This  project  is  funded  under  a  subcontract  with  the 
Massachusetts  Eye  and  Ear  Infirmary,  with  the  col¬ 
laboration  of  Professor  Harlan  Lane  of  Northeastern 
University. 

This  research  aims  to  characterize  the  speech  pro¬ 
duction  of  postlingually  deafened  adults  before  and 
after  they  receive  cochlear  implants,  in  order  to  help 
evaluate  and  improve  prostheses  and  to  constrain 
models  of  the  role  of  hearing  in  speech  production. 

We  have  been  making  longitudinal  measures  of 
speech  acoustics.  For  vowels  the  measures 
include:  vowel  formants,  fundamental  frequency, 
sound  pressure  level,  duration,  and  an  acoustically- 
based  measure  of  inferred  glottal  aperture.  For 
consonants,  the  measures  are  acoustic  correlates 
of  place  and  manner  cues  in  plosives  and  fricatives. 
We  also  measure  suprasegmental  contours  of  pitch 
and  amplitude  and  several  parameters  of  speech 
breathing;  mean  air  flow,  volume  of  air  per  syllable, 
and  speech  initiation  and  termination  levels. 

Consistent  with  informal  reports  and  limited  exper¬ 
imental  evidence  from  other  investigators,  we  have 
found  large  changes  in  the  suprasegmental  proper¬ 
ties  of  speech  following  processor  activation.  In 
particular,  activation  of  the  processor  is  invariably 
followed  by  reductions  in  sound  pressure  level  and 
increases  in  rate,  commonly  followed  by  changes  in 
mean  airflow  in  the  direction  of  normal,  and  some¬ 
times  followed  by  reductions  in  pitch  in  the  direction 
of  normal.  In  addition,  the  pitch  and  amplitude  con¬ 
tours  for  one  subject,  which  were  peculiarly  vari¬ 
able,  became  more  smoothly  modulated  following 
processor  activation.  At  the  segmental  level,  we 
have  found  changes  in  vowel  formants,  durations 
and  amplitudes,  commonly  in  the  direction  of 
normal.  In  addition,  we  have  found  increasing  differ¬ 
entiation  of  voicing  in  the  plosives  and  of  a  place 
cue  in  the  voiceless  fricatives  following  processor 
activation  in  some  speakers.  An  analysis  of 
covariance  among  parameter  changes  within  and 
across  sessions  has  led  to  a  model  of  interactions 
among  supraglottal  and  glottal  articulations  and 
their  regulation  by  auditory  feedback  from  the 
implant.  We  have  developed  software  for  analyzing 
fricative  sounds  to  examine  the  /  s  /  -  /  ^  /  dis¬ 


tinction  pre-  and  post-implant.  Analyses  of  data 
from  four  subjects  is  in  progress. 


1.4.3  Objective  Assessment  of  Vocal 
Hyperfunction 

This  project  is  funded  under  a  subcontract  to  the 
Massachusetts  Eye  and  Ear  Infirmary  in  collab¬ 
oration  with  Dr.  Robert  E.  Hillman. 

In  the  past  year,  this  project  moved  from  the 
Department  of  Communication  Disorders,  Boston 
University,  to  a  new  Voice  and  Speech  Laboratory 
at  the  Massachusetts  Eye  and  Ear  Infirmary.  Dr. 
Hillman  has  been  appointed  director  of  the  new  lab¬ 
oratory,  where  new  facilities  have  been  installed  for 
continuation  of  this  work.  Data  collection  proce¬ 
dures  have  been  expanded  to  include 
plethysmographic  assessment  of  respiratory  func¬ 
tion.  These  data  will  be  analyzed  with  software 
developed  for  use  in  the  Cochlear  Implant  and 
Acoustic  Neuroma  projects.  Software  has  been 
developed  for  automatically  running  and  scoring  lis¬ 
tener  tests  of  voice  quality  judgments.  The  results 
of  these  tests  will  be  compared  with  aerodynamic 
and  acoustic  measures. 


1.4.4  A  Multidimensional  Mathematical 
Model  of  Vowel  Perception  for  Users  of 
Pulsatile  Cochlear  Implants 

We  have  continued  to  work  on  a  mathematical 
model  to  account  for  vowel  perception  by  users  of 
pulsatile  cochlear  implants.  We  have  extended  the 
model  to  three  dimensions  in  order  to  deal  with 
cochlear  implants  that  encode  one  to  three  formant 
frequencies.  These  cochlear  implants  estimate  the 
frequency  of  up  to  three  formants  and  stimulate  one 
to  three  channels  every  fundamental  period  (one 
channel  per  formant).  Thus,  formant  frequencies 
are  encoded  by  the  position  of  the  electrodes  that 
are  stimulated  at  any  given  moment  in  time. 

The  model  was  assessed  using  vowel  confusion 
matrices  obtained  from  four  subjects  who  were  all 
tested  with  stimulation  schemes  that  employed  one, 
two,  and  three  formants.’®  Vowel  confusion  matrices 
obtained  with  the  one-formant  scheme  were  used  to 
estimate  sensitivity  (the  ability  of  each  subject  to 
identify  which  channel  has  been  stimulated)  and 
were  also  used  to  estimate  each  subject's  average 
response  bias.  The  amount  of  bias  was  variable, 
but  in  all  cases  it  was  consistent  with  the  fact  that 
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for  patients  with  cochlear  implants,  a  given  formant 
frequency  causes  stimulation  of  cochlear  areas  that 
are  more  basal  than  me  areas  stimulated  by  the 
same  formant  the  subject  had  normal 

hearing.  The'^p  estimates  of  sensitivity  and  bias 
were  then  used  to  predict  the  vowel  identification 
performance  of  each  subject  under  each  stimulation 
scheme.  The  model  predicted  successfully  that  the 
two-formant  scheme  would  be  better  than  the  one- 
formant  scheme  and  roughly  equal  to  the  three- 
formant  scheme.  For  three  of  the  subjects,  it  also 
made  good  predictions  of  the  number  of  correct 
responses.  A  fourth  subject  performed  better  than 
expected  on  the  two-formant  strategy,  perhaps 
because  he  employed  inter-formant  distance  as  an 
additional  dimension  in  order  to  identify  vowels. 

We  also  used  the  model  to  assess  the  extent  to 
which  different  subjects  adapted  to  the  more-basal- 
than-normal  stimulation  provided  by  their  cochlear 
implants.  Our  results  show  that  all  subjects 
adapted  to  a  large  extent.  One  subject  seemed  to 
adapt  almost  completely,  while  the  three  others 
showed  different  amounts  of  response  bias. 


1.4.5  Vowel  Nasalization  in 
Hearing-impaired  and  Normai-hearing 
Children 

In  our  studies  of  the  nasalization  of  vowels  in 
speech  of  hearing-impaired  and  normal-hearing 
children,  the  bandwidth  of  the  first  formant  and  the 
prominence  of  the  extra  peak  due  to  the  pole-zero 
pair  have  been  found  to  be  significant  acoustic 
cues.  This  result  is  in  accord  with  the  theory  of  the 
nasal  cavity  coupling.  In  the  speech  of  both 
hearing-impaired  and  normal-hearing  children,  a 
wider  first  formant  bandwidth  and  greater  peak 
prominence  were  "bserved  for  those  vowels  that 
were  judged  to  be  more  nasal.  In  acoustic  analysis 
of  nasalized  vowels  spoken  by  normal-hearing 
adults,  a  prominent  peak  was  apparent  around  950 
Hz.  For  hearing-impaired  children,  the  average 
value  was  940  Hz,  which  is  lower  than  that  of  the 
adult,  contrary  to  expectations.  This  may  be  due  to 
the  sensitivity  of  measuring  the  frequency  of  the 
peak  by  using  the  value  of  its  closest  harmonic, 
which  is  dependent  on  the  fundamental  frequency. 
The  difference  between  the  amplitude  of  the  first 
formant  At,  which  is  inversely  related  to  the  band¬ 
width,  and  the  amplitude  of  the  extra  peak  P1, 
which  is  directly  related  to  the  amount  of  separation 
between  the  nasal  pole  and  zero,  was  used  as  a 
measure  of  the  degree  of  vowel  nasalization.  The 
magnitude  of  the  correlation  coefficient  between 
A1-PI  and  the  average  nasality  judgment  for  the 
speech  of  hearing-impaired  and  normal-hearing 
children  was  found  to  be  0.80,  indicating  that  the 


difference  measure  is  a  very  promising  parameter 
for  quantifying  nasalization. 

A  perceptual  experiment  using  resynthesized  ver¬ 
sions  of  utterances  (of  the  form  bVt)  from  a  normal¬ 
hearing  speaker  was  carried  out  to  further  quantify 
the  relation  between  A1-P1  and  judgments  of 
nasality.  For  most  of  ten  vowels,  A^  predominated 
over  PI  in  determining  nasality  perception,  but  the 
parameter  A1-P1  was  found  to  be  more  highly  cor¬ 
related  with  the  judgments  than  was  A^  alone. 
Since  the  locations  of  the  formants,  especially  the 
first  two  formants,  affect  PI,  the  parameter  was 
normalized  using  the  first  formant  frequency  FI  and 
the  second  formant  frequency  F2.  The  normaliza¬ 
tion  did  not  require  the  use  of  formant  bandwidths 
to  give  good  approximations.  The  magnitude  of  the 
correlation  coefficient  of  nasal  judgments  and  nor¬ 
malized  A1-P1  of  synthesized  words  is  0.77,  which 
corresponds  to  the  value  from  the  study  of  the 
speech  hy  normal-hearing  and  hearing-impaired 
children.  The  difference  between  the  amplitude  of 
the  first  formant  and  the  amplitude  of  the  extra 
peak,  A1-P1,  is  promising  in  indicating  the  degree 
of  nasalization.  This  allows  objective  assessment 
of  the  velopharyngeal  opening  and  thus  of  the 
amount  of  nasalization. 


1.4.6  Aiding  Dysarthric  Speakers 

The  intelligibility  of  the  speech  of  some  dysarthric 
individuals  is  often  reduced  relative  to  that  of 
normal  speakers,  making  if  difficult  for  listeners  to 
understand  them.  We  have  been  examining  the 
possibility  of  using  speec.'  recognition  devices  to 
augment  the  communication  abilities  of  these  indi¬ 
viduals.  In  particular,  we  are  attempting  to  deter¬ 
mine  whether,  with  proper  design  of  an  inventory  of 
utterances,  a  speech  recognition  device  can  be 
trained  to  recognize  these  utterances  even  though 
the  utterances  may  not  be  intelligible  to  listeners. 
In  the  process  of  working  toward  this  practical  goal, 
we  hope  to  contribute  to  the  development  of  proce¬ 
dures  for  evaluating  the  nature  of  the  various  defi¬ 
ciencies  that  characterize  the  articulatory  and 
acoustic  patterns  of  dysarthric  speakers. 

Our  initial  research  has  concentrated  on  the  speech 
patterns  of  one  individual,  with  some  preliminary 
evaluation  of  the  speech  of  two  other  speakers. 
The  data  we  have  collected  are  of  three  kinds:  (1) 
evaluation  of  the  intelligibility  of  the  speaker  using 
testing  procedures  that  are  used  by  other  investi¬ 
gators;  (2)  obtaining  measures  of  the  performance 
of  a  speech  recognition  device  (Dragon  Writer  - 
1000)  when  trained  and  tested  on  words  produced 
by  the  dysarthric  speaker  (the  same  words  that 
were  used  in  the  intelligibility  test);  (3)  detailed 
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acoustic  analysis  of  the  speaker's  utterances,  and 
use  of  these  acoustic  data  to  aid  in  the  interpreta¬ 
tion  of  the  results  in  (1)  and  (2). 

These  data  were  analyzed  to  determine  possible 
patterns  of  deviant  articulatory  control  for  the 
speaker.  Probably  the  most  consistent  pattern 
emerging  from  this  analysis  was  the  inability  to  form 
consonants  that  required  raising  of  the  tongue 
blade,  particularly  the  coronal  fricative  consonants. 
As  a  consequence  of  these  initial  experiments,  we 
plan  to  make  a  selection  of  words  for  this  speaker 
that  have  the  potential  of  being  identified  with  high 
accuracy  by  the  speech  recognizer. 

1 .5  Models  for  Lexical  Representation 
and  Lexical  Access 

In  the  model  for  lexical  access  that  we  are  devel¬ 
oping,  lexical  items  are  represented  in  terms  of 
sequences  of  segments,  each  of  which  is  described 
by  a  list  of  distinctive  features.  Acoustic  measure¬ 
ments  are  made  to  determine  selected  landmarks  in 
the  signal,  corresponding  roughly  to  segments. 
Based  on  the  type  of  landmark,  a  further  set  of 
acoustic  properties  is  measured  in  the  vicinity  of 
each  landmark  in  order  to  determine  the  features 
for  the  segment. 

During  the  past  year,  the  lexicon  has  been 
expanded  to  250  words.  Particular  attention  has 
been  directed  toward  developing  rules  for  labelling 
features  or  feature  groups  for  segments  that  are 
susceptible  to  modification  by  certain  contexts:  One 
aspect  of  the  lexicon  is  that  function  words  (such  as 
and,  the,  of)  are  represented  twice — once  in  a  well- 
articulated,  nonreduced  form,  and  once  in  a  form 
that  attempts  to  capture  (in  terms  of  modifiable  fea¬ 
tures)  the  various  reductions  that  the  word  can 
undergo. 

Progress  has  been  made  in  the  first  step  of  the  pro¬ 
cedure  for  estimating  the  segments  and  features 
from  the  speech  signal,  i.e.,  the  automatic  detection 
of  landmarks.  Initial  attention  has  been  given  to 
abrupt  consonantal  landmarks,  at  the  implosion  or 
release  of  [+consonantal,  -sonorant]  segments.  A 
first  hypothesis  concerning  these  landmarks  is 
obtained  by  examining  the  changes  in  energy  in  a 
low-frequency  band  (0-1  kHz)  smoothed  with  a 
20-ms  averaging  time.  The  measure  of  change  is  a 
first  difference  over  a  50-ms  time  interval.  This  pro¬ 
cedure  detects  almost  all  the  desired  landmarks, 
but  also  registers  landmarks  in  the  vicinity  of  some 
glides.  Modifications  of  the  algorithm  are  being 
made  to  improve  the  performance.  These  modifica¬ 
tions  examine  changes  in  energy  in  higher  fre¬ 


quency  bands  as  well  as  changes  in  formant 
frequencies. 

Some  initial  progress  has  been  made  toward  the 
development  of  procedures  for  estimating 
articulator-bound  features,  based  on  measurements 
of  the  acoustic  pattern  in  the  vicinity  of  the  land¬ 
marks.  For  each  of  the  features  and  for  each  type 
of  landmark,  we  have  prepared  a  preliminary  list  of 
the  acoustic  properties  that  contribute  to  identifica¬ 
tion  of  the  feature.  This  list  of  properties  is  derived 
from  analysis  of  speech-production  models  that 
predict  the  acoustic  output  of  the  vocal  tract  when 
the  articulatory  movements  are  given,  as  well  as 
from  analysis  of  natural  speech.  For  example,  the 
properties  that  are  used  to  determine  which  artic¬ 
ulator  produced  a  stop  consonant  release  include 
the  burst  spectrum,  starting  frequencies  of  certain 
formant  transitions,  frequencies  of  the  formants  in 
the  adjacent  vowel,  and  time  course  of  the  formant 
movements. 

We  are  pioceeding  with  the  implementation  of 
some  of  these  procedures  for  estimating  the  fea¬ 
tures  from  acoustic  measurements  in  the  vicinity  of 
the  landmarks. 


1.6  Studies  of  the  Acoustics, 

Perception,  and  Modeling  of  Speech 
Sounds 

Studies  of  the  acoustics  and  modeling  of  speech 
sounds  have  continued  in  four  different  areas:  (1) 
examining  the  changing  acoustic  pattern  in  the 
vicinity  of  the  release  of  fricative  consonants:  (2) 
measuring  and  accounting  for  rapid  changes  in 
spectrum  and  in  amplitude  at  the  times  of  formation 
and  release  of  the  tongue-blade  constriction  for 
lateral  consonants:  (3)  describing  the  detailed 
acoustic  events  at  the  release  of  stop  consonants 
that  follow  the  sound  /s/  in  initial  consonant  clus¬ 
ters,  coupled  with  the  collection  of  data  on  jaw 
movements  during  production  of  the  consonants: 
(4)  developing  quantitative  acoustic  analysis  tech¬ 
niques  to  estimate  the  amount  and  time  course  of 
nasalization  in  a  vowel.  To  aid  in  our  modeling  of 
speech  sound  production,  we  have  completed  a 
computer-based  facility  for  estimating  the  transfer 
function  for  a  multitube  model  of  the  vocal  tract  with 
two  or  more  parallel  paths.  In  all  of  these  studies, 
we  are  attempting  to  refine  our  models  of  pro¬ 
duction  of  the  different  classes  of  sounds.  These 
models  make  certain  predictions  concerning  the 
time-varying  acoustic  characteristics  of  the  radiated 
sound,  given  the  articulatory  shapes  and  move¬ 
ments  that  are  used  to  generate  the  sounds.  Com¬ 
parisons  of  the  measured  acoustic  properties  with 
those  predicted  from  the  models  are  then  used  to 
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revise  the  estimates  of  the  articulatory  movements 
so  that  agreement  between  measurements  and 
model  outputs  is  achieved. 


1.7  Development  of  Computer  and  Data 
Processing  Facilities 

A  program  has  been  written  to  display  high-quality 
spectrograms  on  our  engineering  workstations.  The 
spectrogram  display  includes  a  cursor  which  can  be 
time-aligned  with  cursors  in  MITSYN-based  pro¬ 
grams  for  interactive  data  extraction  or  in  the  Klatt 
software  used  for  acoustic  analyses.  This  display 
will  facilitate  the  identification  of  abrupt  spectral 
transitions  in  acoustic  signals. 

The  KLSYN88  speech  synthesis  facility  has  been 
augmented  to  include  a  transient  source,  consisting 
of  a  single  sample,  which  can  be  filtered  in  the 
same  way  as  the  frication  source.  This  new  source 
is  used  to  simulate  the  transients  that  occur  at  the 
release  of  stop  or  affricate  consonants. 
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1.1  Introduction 

The  Sensory  Communication  Group  is  conducting 
research  on  (1)  the  auditory  and  tactual  senses,  (2) 
auditory,  visual,  and  tactual  aids  for  individuals  who 
are  hearing-impaired  or  deaf,  and  (3)  human- 
machine  interfaces  for  teieoperator  and  virtual- 
environment  systems  (involving  the  visual  as  well 
as  the  auditory  and  tactual  senses).  Within  the 
domain  of  hearing  aids,  research  is  being  con¬ 
ducted  on  systems  that  bypass  the  outer  and 
middle  ear  and  directly  stimulate  the  auditory  nerve 
electrically  (cochlear  prostheses),  as  well  as  on 
systems  that  stimulate  the  ears  acoustically.  The 
research  on  taction  is  focused  not  only  on  speech 
reception  for  the  totally  deaf,  but  also  on  the  ability 
of  the  human  hand  to  sense  and  manipulate  the 
environment.  Within  the  domain  of  human-machine 
interfaces,  topics  of  special  interest  concern  (1) 
development  of  principles  for  mapping  the  human 
sensorimotor  system  into  non-anthropomorphic 
slave  mechanisms  (or  the  equivalent  in  virtual 
space)  and  (2)  ability  of  the  human  sensorimotor 
system  to  adapt  to  alterations  of  normal 
sensorimotor  loops  caused  by  the  presence  of  the 
interface. 
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During  the  past  year,  our  research  has  focused  on 
(1)  linear  amplification,  (2)  the  effects  of  token  vari¬ 
ability,  (3)  a  computational  model  of  speech  intelli¬ 
gibility,  and  (4)  aids  to  speechreading. 
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1 .2.1  Linear  Amplification 

Adaptive  Control  of  Gain 

In  our  earlier  work,  we  examined  the  potential  of 
adaptive  filtering  for  improving  speech  reception  in 
noise.  We  found  a  good  association  between 
upward  spread-of-masking  caused  by  high-intensity 
octave  bands  of  noise  and  speech  intelligibility 
scores  when  test  conditions  were  characterized  by 
the  articulation  index  (Al).’  These  studies  support 
the  use  of  the  Al  for  estimating  the  effects  of  noise 
interference  on  speech  reception  both  for  normal- 
hearing  and  hearing-impaired  groups  of  listeners. 
Given  these  positive  results,  the  Al  model  appears 
to  provide  a  means  for  defining  a  frequency-gain 
characteristic  that  will  maximize  intelligibility  for  any 
given  case  of  interfering  background  noise. 

During  the  past  year,  we  have  taken  a  closer  look 
at  two  potential  limitations  to  applying  the  Al  model 
to  the  clinical  problem  of  selecting  hearing  aid 
frequency-gain  characteristics  for  listening  in  noise. 
These  limitations  are  excess  masking  demonstrated 
by  hearing-impaired  subects  and  individual  subject 
differences.  With  this  in  mind,  we  are  now  con¬ 
ducting  a  detailed  examination  of  the  extensive  sets 
of  narrowband  noise  masking  patterns  that  resulted 
from  our  earlier  studies. 

Our  masking  pattern  corpus  includes  data  obtained 
from  five  normal-hearing  subjects  and  seven 
hearing-impaired  subjects.  Masking  patterns  con¬ 
sisted  of  pure-tone  thresholds  measured  every  one- 
third-octave  in  the  presence  of  octave  bands  of 
noise  centered  on  0.5,  1.0,  or  2.0  kHz  and  were 
obtained  as  a  function  of  masker  intensity  (70  to  95 
dB  SPL  in  5-dB  steps).  We  have  compared  the 
hearing-impaired  subjects'  masking  patterns  with 
the  averaged  patterns  of  the  normal-hearing  sub¬ 
jects.  We  found:  (1)  a  low-frequency  spread-of- 
masking  for  both  normal-hearing  and 
hearing-impaired  subjects  that  was  roughly  constant 


for  the  masker  levels  used  here;  (2)  shallower 
slopes  on  the  high-frequency  side  of  masking  pat¬ 
terns  for  impaired  listeners:  and  (3)  for  some 
impaired  listeners,  substantially  greater  masking 
within  the  masker  band.  In  addition,  our  results 
show  that  decreasing  the  masker's  intensity  did  not 
decrease  upward  spread  of  masking  as  rapidly  for 
the  hearing-impaired  subjects  as  it  did  for  normal¬ 
hearing  subjects.  These  results  provide  an  explan¬ 
ation  for  the  smaller  improvements  in  speech 
reception  in  noise  demonstrated  by  some  hearing- 
impaired  subjects  following  attenuation  of  the  fre¬ 
quency  region  containing  a  band  of  masking  noise.^ 

We  have  begun  to  compare  individual  subject 
masking  patterns  to  those  predicted  from  algorithms 
such  as  that  described  by  Ludvigsen,^  which  esti¬ 
mates  masked  thresholds  based  on  the  noise  spec¬ 
trum  and  audiogram.  So  far,  we  have  found  some 
cases  in  which  Ludvigsen's  model  underestimates 
masking  substantially,  and  we  are  exploring 
whether  these  deviations  will  cause  substantial 
error  in  prediction  of  benefit  to  speech  reception. 
This  type  of  verification  is  possible  only  because  we 
have  both  the  masking  patterns  and  the  speech 
intelligibility  for  each  listening  condition.  As  part  of 
this  project,  we  plan  to  conduct  a  similar  compar¬ 
ison  with  the  recent  masking  model  proposed  by 
Humes  et  al.* 

Another  area  of  research  has  been  the  utilization  of 
the  Al  to  evaluate  the  effectiveness  of  hearing  aid 
gain-assignment  schemes  which  we  have  extended 
to  include  background  noise.  We  calculated  Als  for 
speech  in  speech-shaped  noise  and  for  speech  in  a 
high-intensity,  low-frequency  octave  band  noise,  fol¬ 
lowing  the  application  of  frequency-gain  character¬ 
istics  assigned  by  several  popular  gain-assignment 
prescriptions  for  more  than  30  different  audiograms. 
Results  support  the  contention  that  varying  the 
frequency-gain  characteristic  as  a  function  of  back¬ 
ground  noise  is  required  in  order  to  maximize 
speech  spectrum  audibility. 


'  C.M.  Rankovic,  P.M.  Zurek,  and  R.L.  Freyman,  "Potential  Benefits  of  Varying  the  Frequency-Gain  Characteristic  for  Speech  Recep¬ 
tion  in  Noise  for  Hearing-impaired  Individuals,"  J.  Acousl.  Soc.  Am.  90(Pt.2);  2319  (1991):  C.M.  Rankovic,  R.L  Freyman,  and  P.M. 
Zurek,  "Potential  Benefits  of  Varying  the  Frequency-Gain  Characteristic  for  Speech  Reception  in  Noise,"  J.  Acoust.  Soc.  Am.  91: 
354-362  (1992). 

2  C  M  Rankovic,  P.M.  Zurek,  and  R.L.  Freyman,  "Potential  Benefits  of  Varying  the  Frequency-Gain  Characteristic  for  Speech  Recep¬ 
tion  in  Noise  for  Hearing-impaired  Individuals,"  J.  Acoust.  Soc.  Am.  90(Pt.2):  2319  (1991):  J.N.  van  Dijkhuizen,  J.M.  Fasten,  and  R. 
Plomp,  "The  Effect  of  Frequency-Selective  Attenuation  on  the  Speech-Reception  Threshold  of  Sentences  in  Conditions  of  Low- 
frequency  Noise,"  J.  Acousl.  Soc.  Am.  90:  885-894  (1991). 

3  C.  Ludvigsen,  "Relations  Among  Some  Psychoacoustic  Parameters  in  Normal  and  Cochlearly  Impaired  Listeners,"  J.  Acoust.  Soc. 
Am.  78:  1271-1280  (1985). 

*  L.E.  Humes,  B.  Espinoza-Varas,  and  C.S.  Watson,  "Modeling  Sensorineural  Hearing  Loss.  I.  Model  and  Restrospective  Evaluation," 
J.  Acousl.  Soc.  Am.  83:  188-202  (1988). 
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Real-time  Multiband  Adaptive  Gain  Control 

The  development  of  multiband  adaptive  gain-control 
algorithms  is  being  pursued  for  two  reasons.  First, 
there  is  the  clear  desire  for  an  automatic  volume 
control  in  hearing  aids  that  would  obviate  the  need 
for  the  user  to  constantly  adjust  the  aid  manually  for 
different  acoustic  environments.  Multiband  control  is 
being  studied  because  the  desired  compensation  is 
often  frequency  dependent  (in  the  case,  for 
example,  of  head  shadow).  Second,  our  work®  and 
that  of  others  (e.g.,  van  Dijkhuizen  et  al.«)  has  dem¬ 
onstrated  the  benefit  of  increased  speech  intelli¬ 
gibility  gained,  under  some  restricted  conditions, 
from  adjusting  the  frequency-gain  characteristic  to 
attenuate  spectrally-localized  interference.  A  well- 
designed  algorithm  should  be  capable  of  both  con¬ 
trolling  loudness  and,  when  possible,  improving 
speech  reception  by  reducing  the  upward  spread  of 
masking. 

This  study^  is  an  evaluation  of  two  gain-control 
algorithms,  relative  to  a  fixed  linear  system,  for 
improving  intelligibility  and  reducing  noisiness.  The 
fact  that  the  systems  under  study  are  time-varying 
has  important  implications  for  how  they  are  imple¬ 
mented  and  evaluated.  One  implication  is  that  a 
real-time  system  is  preferable  for  the  flexibility  it 
offers  in  introducing  transient  and  time-varying 
sources  of  interference.  However,  because  we  also 
seek  to  evaluate  the  systems  with  time-varying 
interference,  this  makes  traditional  intelligibility 
testing  impractical.  Such  intelligibility  tests  would 
also  not  assess  noisiness,  the  other  dimension  for 
which  the  systems  should  be  evaluated. 

In  light  of  these  considerations,  we  implemented  a 
real-time  filter  bank  (fourteen  filters  with  roughly 
third-octave  widths)  with  adaptive  gain  controlled 
independently  in  each  band.  A  variety  of  interfer¬ 
ence  sources,  both  transient,  time-varying,  and 
steady,  were  added  to  a  continuous  reading  (of  a 
book-on-tape)  and  presented  as  the  input  to  each 
of  the  systems.  Additionally,  the  input  level  of 
speech  in  quiet  was  varied.  Seven  subjects  with 
moderate  hearing  impairment  participated.  In  an 
initial  session,  listeners  adjusted  the  overall  level  of 


speech  that  had  been  spectrally  shaped  to  match 
their  threshold  curves.  These  adjustments  served 
to  determine  the  gain  in  the  fixed  linear  system,  as 
well  as  the  target  output  band  levels  in  the  adaptive 
systems.  In  the  evaluations,  subjects  rated  the 
outputs  of  the  systems  according  to  both  intelli¬ 
gibility  of  the  speech  and  overall  noisiness  on 
scales  from  0  to  100. 

The  first  of  the  two  adaptive-gain  systems  is  called 
Multiband  Automatic  Volume  Control  (MAVC) 
because  it  is  designed  to  maintain  the  output  of 
each  band  at  a  fixed  level,  with  a  limit  on  maximum 
gain.  Thus,  the  static  input-output  characteristic  has 
a  fixed  gain  to  a  kneepoint  and  is  horizontal  above 
that  point.  The  second  system,  the  Noise  Control 
system,  is  similar  to  the  MBAVC  System,  except 
instead  of  using  fixed  band  output  target  levels,  its 
band  targets  depend  on  the  signal-to-noise  ratios, 
where  band  SNRs  were  estimated  using  an  algo¬ 
rithm  described  by  Festen  et  al.®  The  idea  here  is  to 
amplify  less  when  the  band  is  dominated  by  noise 
than  when  it  is  a  clear  speech  signal.  In  both 
systems,  the  level  estimation  needed  for  controlling 
gain  is  done  with  a  simple  RC  envelope  detector 
with  a  one-second  time  constant. 

Generally,  results  showed  the  expected  effects  of 
gain  control,  most  of  which  are  beneficial.  The 
average  intelligibility  ratings  for  the  gain-control 
systems  were  significantly  higher  than  for  the  fixed 
system  when  speech  (in  quiet)  is  presented  below 
the  average  level.  Also,  when  the  input  was  only 
noise  at  a  relatively  low  level,  the  gain-control 
systems  were  rated  to  be  noisier  than  the  fixed 
system.  When  the  input  noise  level  was  greater 
than  the  expected  speech  level,  the  fixed  system 
amplified  the  noise  above  the  target  level,  while  the 
gain-control  systems,  in  the  steady  state,  presented 
the  noise  at  or  below  the  target  level. 

Most  of  the  noisiness  ratings  for  the  two  gain- 
control  systems  were  lower  than  those  for  the  fixed 
system  when  the  environmental  noises  lasted  long 
enough  for  the  gain-control  systems  to  react.  The 
Noise  Control  System  was  rated  the  least  noisy 
system  in  most  of  the  high  noise  level  conditions. 


5  C.M.  Rankovic,  R.L.  Freyman,  and  P.M.  Zurek,  "Potential  Benefits  of  Varying  the  Frequency-Gain  Characteristic  for  Speech  Recep¬ 
tion  in  Noise,"  J.  Acoust.  Soc.  Am.  91;  354-362  (1992). 

6  J.N.  van  Dijkhuizen,  J.M,  Festen,  and  R.  Plomp,  "The  Effect  of  Frequency-Selective  Attenuation  on  the  Speech-Reception  Threshold 
of  Sentences  in  Conditions  of  Low-frequency  Noise,"  J.  Acoust.  Soc.  Am.  90:  885-894  (1991). 

r  G.R.  Martin,  Studies  of  Real-Time  Multiband  Adaptive  Gain  Hearing  Aids,  S.M.  thesis.  Dept,  of  Electr.  Eng.  and  Comput.  Sci.,  MIT, 
1992. 

8  J.M.  Festen,  J.M.  van  Dijkhuizen,  and  R.  Plomp,  "Considerations  on  Adaptive  Gain  and  Frequency  Response  in  Hearing  Aids,"  Acta 
Otoiaryn.  Suppl.  469: 196-201  (1990). 
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showing  the  effectiveness  of  SNR-dependent  gain 
control. 

Intelligibility  enhancements  with  gain  control  are 
expected  for  noise  sources  whose  average  spec¬ 
trum  is  different  from  the  speech  spectrum.  The 
gain-control  systems  react  to  such  noise  sources  by 
decreasing  the  gain  more  in  the  bands  where  the 
noise  is  strong,  thereby  reducing  spread  of 
masking.  The  results  indicated  that  this  expectation 
was  realized.  There  was  no  significant  intelligibility 
enhancement  or  degradation  when  speech  plus 
speech-shaped  noise  was  tested.  With  other 
noises  that  are  not  shaped  like  speech,  intelligibility 
ratings  were  better  with  gain  control  in  most 
instances.  Those  noises  whose  spectra  were  the 
least  shaped  like  speech  resulted  in  the  largest 
increases.  Generally,  intelligibility  ratings  of  the 
MBAVC  and  the  Noise  Control  systems  were 
similar  under  most  conditions. 

These  results  show  that  there  is  promise  for  both 
the  Noise  Control  and  MBAVC  Systems  to  outper¬ 
form  the  fixed  system  with  respect  to  the  intelli¬ 
gibility  of  speech  and  perceived  noisiness  of  slowly 
varying  background  noises  that  are  not  spectrally 
shaped  like  speech.  The  results  also  show  that 
there  is  promise  for  the  Noise  Control  System  to 
outperform  the  MBAVC  system  with  respect  to  the 
perceived  noisiness  of  slowly  varying  background 
noises  that  are  not  shaped  like  speech,  while  at  the 
same  time,  not  decreasing  the  intelligibility  of  the 
speech. 

One  drawback  of  gain-control  systems  is  the  time 
needed  to  react  to  changing  sound  levels  (a  time 
constant  of  one  second  was  used).  For  instance, 
during  periods  of  quiet,  the  gain  for  these  systems 
would  be  large,  and,  if  a  noise  source  suddenly 
occurs,  it  will  be  quite  loud  for  a  second  or  so.  To 
address  this  problem,  we  are  planning  to  implement 
a  scheme  like  Moore's  "dual-front  end"  for  quickly 
suppressing  strong  onsets  (and  releasing  control 
quickly  if  the  sound  is  a  transient).® 


Feedback  Cancellation 

Several  signal  processing  techniques  for  reducing 
feedback  have  been  proposed  recently.  Thus  far, 
however,  evaluations  of  these  methods  have  been 
limited  to  measurements  of  the  increase  in  stable 
gain  that  can  be  achieved.  Furthermore,  the  condi¬ 
tions  under  which  these  measurements  have  been 
made  are  very  different  from  one  study  to  another. 
In  this  work,  our  initial  goal  was  to  implement  the 
most  promising  of  the  existing  techniques  and 
compare  them  not  only  for  stable  gain  but  also  for 
their  effects  on  sound  quality  and  annoyance.  In  the 
process  of  studying  these  systems,  a  novel  proce¬ 
dure  was  developed,  and  this  is  also  included  for 
comparison. 

The  five  feedback-reduction  algorithms  being  exam¬ 
ined  are  an  adaptive  notch  filter  and  four  variants  of 
adaptive  cancellation.  The  adaptive  notch  method 
had  been  previously  proposed  for  use  in  public- 
address  systems.  This  system  has  not  been  evalu¬ 
ated  in  applications  to  hearing  aids,  though  it  has 
been  used  in  an  algorithm  for  detecting  oscillation.’® 

The  adaptive  cancellation  methods  have  a  common 
goal  of  estimating  feedback  path  in  order  to  cancel 
it  with  another  out-of-phase  feedback  path.  The 
primary  difference  among  methods  is  in  the  way 
that  the  estimation  is  performed.  In  the  algorithm 
described  by  Bustamante  et  al.,”  a  delayed  version 
of  the  output  signal  of  the  hearing  aid  is  correlated 
against  the  microphone  signal.  The  system  of 
Engebretson  et  al.’®  is  the  same  but  with  the  inten¬ 
tional  addition  of  noise  to  the  output  as  a  probe 
signal.  In  both  of  these  systems,  adaptation  takes 
place  continuously.  Kates’  approach  differs  in  that 
the  system  waits  for  oscillation  to  occur,  then  inter¬ 
rupts  the  main  signal  path  and  inserts  a  probe 
noise  for  feedback-path  estimation.  In  the  novel 
system  developed  as  part  of  this  work,  the  esti¬ 
mation  is  performed  by  inserting  a  probe  i.oise 
during  selected  time  intervals  when  the  input  signal 
is  estimated  to  be  low.  This  approach  has  three 
advantages  over  Kates:’®  (1)  it  provides  a  quasi- 
continuous  update  of  the  estimation,  not  simply 
waiting  for  oscillation  to  occur;  (2)  it  achieve?  the 
update  without  losing  large  segments  of  the  signal; 


9  B.C.J.  Moore,  "How  Much  Do  We  Gain  by  Gain  Control  in  Hearing  Aids,"  Acta  Ololaryn.  Suppi  469;  250-256  (1990). 
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and  (3)  because  the  input  signal  interferes  with  the 
estimation  of  the  feedback  path,  it  provides  a  better 
estimate  by  adapting  only  during  intervals  when  the 
input  signal  is  small. 

All  five  methods  have  been  implemented  with  a 
Motorola  DSP96002  chip  and  are  currently  being 
evaluated  for  increased  stable  gain,  speech  quality, 
and  user  annoyance.  Real-time  implementation  is 
necessary  for  assessing  the  performance  of  the 
algorithm  under  dynamic  conditions.  Increased 
stable  gain  is  measured  physically,  while  sound 
quality  and  annoyance  are  being  determined  by 
subjective  listening  tests.  For  system  evaluation, 
the  acoustic  feedback  path  is  simulated  with  an 
electrical  feedback  path.  The  electrical  feedback 
path  provides  more  control  over  the  experimental 
variables,  a  necessity  when  comparing  performance 
of  the  feedback  suppression  algorithms.  This  path 
contains  a  flexible  multifilter  for  shaping  the  desired 
transfer  function.  Dynamic  simulation  is  accom¬ 
plished  by  controlled  changes  in  the  feedback  path. 

We  are  just  beginning  the  evaluation  process.  It 
has  already  become  evident,  however,  that  the 
continuous-adaptation  schemes'^  are  prone  to 
diverge,  even  in  the  absence  of  feedback. 
Inspection  of  these  systems  shows  that  they  are 
adaptive  recursive  filters,  for  which  special  precau¬ 
tions  must  be  taken  to  assure  convergence.  Both 
of  the  other  adaptive  schemes  allow  adaptation  only 
during  times  when  the  feedback  loop  is  opened, 
and  thereby  avoid  this  problem. 


1.2.2  Speech  Token  Variability 

This  research  consists  of  three  components:  (1) 
characterizing  the  variability  in  the  acoustic  proper¬ 
ties  of  speech  sounds,  (2)  measuring  the  perceptual 
effects  of  token  variability  on  intelligibility  in  normal 
and  hearing  impaired  listeners,  and  (3)  developing 
a  model  that  relates  acoustical  and  perceptual 
effects.  Progress  in  previous  years  generally  was 
made  in  the  first  component,  i.e.,  measurement  of 
the  physical  variability  in  vowels  and  fricatives. 
Recently,  we  have  made  progress  on  the  second 
and  third  components. 

The  stimuli  used  in  all  the  perceptual  experiments 
are  repetitions  of  vowels  in  an  /h/-V-/d/  context  by 


the  same  speaker.  The  /h/-V-/d/  syllables  were  dig¬ 
itally  distorted  in  a  manner  described  by  Schroeder 
(1968).  Three  new  experiments  have  been  per¬ 
formed.''*  The  first  experiment  examined  the  ability 
of  trained  listeners  to  identify  different  tokens  of  the 
same  vowel  sound  as  the  number  of  utterances  or 
tokens  was  varied.  For  tokens  of  the  vowel  /i/, 
identification  scores  declined  from  85  percent  for 
four  tokens,  to  64  percent  for  eight  tokens,  and  50 
percent  for  16  tokens.  This  suggests  that  the  per¬ 
ceptual  differences  between  tokens  are  not  suffi¬ 
ciently  salient  to  permit  reliable  identification.  The 
second  experiment  investigated  the  extent  to  which 
variability  in  the  multiplicative  distortion  of  the 
stimuli  affected  discriminability.  The  results  show 
that  for  the  vowel  pair  /i/-/u/,  effects  of  the  distortion 
are  minimal  compared  to  the  effects  of  token  vari¬ 
ability.  For  the  vowel  pair  eh-uh,  on  the  other  hand, 
effects  of  both  token  and  distortion  variation  are 
comparable. 

In  the  third  experiment,  vowel  discrimination  was 
measured  for  the  vowel  pair  ae-ah  as  a  function  of 
the  token  composition  of  the  stimulus  set.  In  this 
experiment,  eight  tokens  were  used  to  represent 
each  vowel  sound.  In  general,  the  relative 
discriminability  of  each  vowel  sound  was  not 
strongly  affected  by  the  composition  of  the  stimulus 
set.  This  permits  the  relative  discriminability  to  be 
correlated  with  the  physical  characteristics  of  the 
stimuli  that  were  made  earlier.  Linear  combinations 
of  the  physical  parameters  computed  by  Multiple 
Regression  Analysis  were  generally  able  to  predict 
more  than  60  percent  of  the  variance  in  the  percep¬ 
tual  variables. 


1.2.3  Prediction  of  Speech  Inteiligibiiity 

This  project  involves  development  and  evaluation  of 
computational  methods  for  determining  the  intelli¬ 
gibility  of  speech  subjected  to  a  waveform  degrada¬ 
tion  or  signal-processing  transformation.  The 
methods  employ  a  model  of  the  human  auditory 
system  which  reduces  a  speech  waveform  to  a 
sequence  of  discrete  symbols,  each  representing  a 
prototypical  vector  of  parameter  values  measured  in 
a  single  frame  (e.g.,  10  ms)  of  speech.  The  percep¬ 
tual  effect  of  the  transformation  is  estimated  by 
measuring  the  consistency  between  the  symbol 
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sequence  derived  from  an  untransformed  (input) 
speech  signal  and  the  symbol  sequence  derived 
from  a  transformed  (output)  signal.  This  estimate  is 
implemented  via  calculation  of  percent  transmitted 
information  (%TI). 

Previous  research  used  the  energy  in  each  of  20 
critical  bands  as  the  source  of  the  parameter 
vector.  During  the  past  year,  we  have  focused 
instead  on  a  different  set  of  20  parameters,  derived 
from  the  Ensemble  Interval  Histogram  (EIH)  model 
developed  by  Ghitza’®  at  AT&T  Bell  Laboratories. 
The  central  idea  of  this  model  is  that  speech  infor¬ 
mation  is  represented  in  the  lengths  of  intervals 
between  crossings  of  certain  waveform-amplitude 
values.  Implementations  of  this  model  (e.g.,  Ghitza, 
1992;  Jankowski,  1992)'®  are  largely  comprised  of 
two  major  stages:  (1)  filtering  speech  into  a  number 
of  bands,  and  (2)  computing  crossing  intervals  for 
the  individual  band  waveforms.  In  the  current  study, 
a  primary  goal  was  to  keep  the  implementation  as 
similar  as  possible  to  that  used  for  computing  the 
band-energy  parameter  set.  In  particular,  the  new 
computation  maintained  the  frequency  region 
spanned  by  the  bandpass  filters,  number  of 
bandpass  filters,  frame  rate  of  the  short-time 
speech  analysis,  and  total  number  of  parameters 
extracted. 

The  %TI  values  obtained  via  this  model  can  be 
compared  to  those  obtained  via  the  band-energy 
computation  for  each  of  three  types  of  degradation; 
additive  noise,  highpass  filtering,  and  lowpass  fil¬ 
tering.  Results  for  the  two  parameter  sets  were 
quite  similar  in  the  ease  of  noise.  At  low  SNRs,  %TI 
values  for  EIH  were  higher  than  those  for  band- 
energy  by  about  2  percentage  points.  At  high 
SNRs,  %TI  values  for  EIH  were  lower  than  those 
for  band-energy  by  about  4  percentage  points.  The 
crossover  point  at  which  the  two  parameter  sets 
gave  equal  results  was  approximately  +10  dB  SNR. 
Since  the  EIH  model  seemed  slightly  more  resistant 
to  noise  than  the  band-energy  model,  results  were 
also  calculated  for  negative  SNRs,  which  had  not 
been  previously  considered.  These  were  much 
lower  than  those  for  the  positive  SNRs,  with  %TI 
values  ranging  from  17.2  at  -5  dB  SNR  to  3.6  at  -20 
dB  SNR. 

For  the  highpass-filtering  degradation,  %TI  values 
for  EIH  were  much  lower  than  for  band-energy.  The 


average  difference  was  about  17  percentage  points: 
this  was  fairly  uniform  across  cutoff  frequencies. 
For  lowpass  filtering,  the  %TI  values  were  again 
lower  for  EIH,  but  with  a  smaller  discrepancy  aver¬ 
aging  about  9  percentage  points.  Differences  were 
greater  at  higher  cutoff  frequencies  than  at  lower 
ones.  In  fact,  at  the  500-Hz  cutoff  frequency,  the 
%TI  value  for  EIH  was  the  higher  one  (but  by  less 
than  one  percentage  point).  Because  of  these  dif¬ 
ferences  in  the  highpass  and  lowpass  conditions, 
the  crossover  frequency  calculated  from  the  EIH 
results  was  much  lower  than  that  calculated  from 
the  band-energy  results:  1000  Hz  vs  1750  Hz. 

Across  all  conditions  tested,  %TI  results  for  the  EIH 
model  were  usually  lower  than  those  for  the  band- 
energy  model.  This  suggests  that  degradations  gen¬ 
erally  affected  the  consistency  of  level-crossing 
information  more  than  they  affected  the  consistency 
of  band-energy  information.  The  absolute  numerical 
values  of  %TI  are  of  greatest  interest  in  the 
low-SNR  noise  conditions.  In  particular,  it  would  be 
useful  if  these  values  provided  an  adequate  charac¬ 
terization  of  intelligibility  across  all  noise  levels  for 
which  humans  retain  some  reasonable  speech- 
reception  ability.  However,  the  -20  dB  SNR  result, 
3.6  %TI,  is  comparable  to  the  2.0  %TI  obtained 
from  randomly  generated  symbol  sequences  of  the 
length  used  in  these  calculations.  Thus,  consider¬ 
ably  more  speech  data  would  need  to  be  processed 
for  reliable  %TI  results  in  intense  noise  degrada¬ 
tions. 

However,  absolute  %TI  values  are  not  the  only  data 
of  interest:  instead,  the  relationship  between  %TI 
values  for  noise  and  filtering  is  also  important.  In 
particular,  one  characteristic  of  the  result  pattern  for 
the  band-energy  model  is  that  noise  degradations 
of  little  perceptual  significance  reduced  %TI  as 
much  as  did  filtering  degradations  that  have  a 
somewhat  larger  perceptual  effect.  In  going  from 
the  band-energy  model  to  the  EIH  model,  the  %TI 
values  for  filtering  dropped  more  than  did  the  %TI 
values  for  noise,  therefore  leading  to  a  result 
pattern  that  is  more  perceptually  realistic. 

One  final  observation  of  the  EIH  results  is  the  pres¬ 
ence  of  particularly  low  %TI  values  for  highpass  fil¬ 
tering.  One  possible  explanation  for  this  can  be 
found  in  the  selection  of  the  amplitude  levels  for 
which  crossings  are  located.  These  levels  span  the 
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amplitude  range  found  in  wideband  speech. 
Highpass-filtered  speech  is  considerably  attenuated 
and  rarely  crosses  the  higher  levels.  This  leads  to 
the  EIH  parameters  providing  a  poorer  represen¬ 
tation  of  the  speech  information.  Also,  independent 
of  these  characteristics  of  the  results,  another  dis¬ 
advantage  to  use  of  the  EIH  model  is  that  its  com¬ 
putation  time  is  approximately  ten  times  that  of  the 
band-energy  model. 

In  addition  to  work  on  the  EIH  model,  some  supple¬ 
mentary  computations  were  executed  during  the 
past  year  to  address  the  issue  of  why  very  slight 
degradations  result  in  %TI  values  that  are  consider¬ 
ably  less  than  100.  Of  these,  one  that  seemed 
insightful  had  a  complete  absence  of  degradation  at 
the  waveform  level,  and  instead  perturbed  the  cal¬ 
culation  of  the  symbol  sequence  by  use  of  a  dif¬ 
ferent  initial  guess  in  the  vector-quantization  stage. 
These  experiments  helped  to  confirm  the  hypoth¬ 
esis  that  a  substantial  drop  in  %TI  is  caused  solely 
by  reassignment  of  prototype  vectors.  In  some 
cases  of  particularly  mild  degradation,  it  can  be 
beneficial  to  omit  this  reassignment  and  simply  use 
the  prototype  vectors  from  undegraded  speech. 

Finally,  there  were  some  calculations  performed  to 
validate  portions  of  the  experimental  methods. 
Issues  of  interest  included  whether  the  %TI  results 
were  strongly  dependent  on  the  particular  VQ  initial¬ 
ization  method,  whether  accounting  for  waveform 
delays  due  to  filtering  (either  for  degradation  pur¬ 
poses  or  within  a  filter  bank)  would  affect  the 
results,  and  whether  the  7oTI  measure  had  a  rea¬ 
sonable  relationship  to  percentage  of  exact  symbol 
matches  (i.e.,  a  percent-correct  measure).  None  of 
these  uncovered  any  significant  methodological  dif¬ 
ficulty. 


1.2.4  Aids  to  Speechreading 

During  the  past  grant  year,  our  research  on  aids  to 
speechreading  has  continued  to  focus  on  (1)  evalu¬ 
ating  the  effectiveness  of  Cornett's  (1967)  Manual 
Cued  Speech  system  and  (2)  developing  low  band¬ 
width  acoustic  signals  that  enhance 
speechreading. 


Cued  Speech 

The  system  of  Manual  Cued  Speech  was  devel¬ 
oped  by  Cornett  (1967)  as  a  supplement  for 
speechreading  that  could  be  taught  to  very  young 
deaf  children  and  used  in  day-to-day  communi¬ 
cation.  In  Cued  Speech,  eight  visually  distinct  hand 
shapes  are  used  to  distinguish  between  consonants 
that  are  not  well  resolved  via  speechreading  and 
four  hand  positions  near  the  mouth  provide  dis¬ 
tinctions  within  groups  of  vowels  often  confused  in 
speechreading.  During  the  past  year,  we  completed 
our  initial  study  of  the  effectiveness  of  Cued 
Speech  and  developed  an  analysis  that  can  be 
used  to  estimate  the  effectiveness  of  automatic 
cueing  systems  based  on  automatic  speech 
recognizers.’® 

As  previously  reported,  we  have  documented  the 
sentence  reception  ability  of  four  highly-trained 
receivers  of  Manual  Cued  Speech.  The  availability 
of  cues  permitted  near  perfect  reception  of  high- 
context  everyday  sentences  spoken  at  near  normal 
speaking  rates,  and  very  high  scores  (roughly  85 
percent  of  the  words  were  correct)  were  obtained 
for  low-context  sentences.’®  In  control  tests  on  cued 
and  uncued  sentences,  selected  speechreaders 
who  had  no  previous  experience  with  Cued  Speech 
achieved  word  scores  in  the  range  of  24-36  percent 
on  everyday  sentences  and  15-22  percent  on  the 
low-context  sentences.  Although  the  levels  of 
speech  reception  achieved  by  the  trained  receivers 
is  impressive,  further  analysis  indicates  that  a  sig¬ 
nificant  fraction  (10-20  percent)  of  speech  segments 
were  not  correctly  perceived.  In  a  related  study,  we 
measured  how  well  spectrogram  readers  and  an 
ASR  system  could  assign  segments  to  cue  groups 
for  various  cue  systems.  Well-trained  spectrogram 
readers  were  able  to  assign  six  consonant  cues 
with  an  accuracy  of  85  percent  and  eleven  vowel 
cues  with  an  accuracy  of  62  percent;  an  expert 
reader  was  able  to  achieve  scores  of  94  percent 
and  83  percent  on  these  tasks.  An  automatic 
speech  recognizer  recently  developed  at  MIT 
achieved  accuracies  of  77  percent  and  65  percent 
on  comparable  tasks.  In  the  third  study,  we  applied 
the  Post-Labeling  Model  of  audiovisual  integration*® 
to  these  recognizer  measurements  and  data  on 
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human  recognition  of  consonant  and  vowel  seg¬ 
ments  via  speechreading  to  estimate  the  benefit  to 
speechreading  provided  by  such  cues.  The  analysis 
suggests  that  with  cues  derived  from  current 
recognizers,  consonant  and  vowel  segments  would 
be  received  with  accuracies  in  excess  of  80 
percent.  This  level  of  accuracy  is  roughly  sufficient 
to  account  for  the  scores  our  subjects  achieved  on 
M'  ,iual  Cued  Speech  reception.  Use  of  the  Post- 
‘  abeling  Model  provides  analytic  means  for  (1) 
determining  optimum  cue  groups  for  a  given 
recognizer  and  speechreader  and  (2)  estimating  the 
cueing  performance  that  might  be  achieved  if  the 
accuracy  of  current  recognizers  were  improved  in 
various  ways,  e.g.,  by  reducing  the  frequency  of 
voicing  errors. 

Based  on  these  results,  it  seems  appropriate  to 
begin  development  of  an  automatic  cueing  system 
based  on  an  automatic  speech  recognizer  and  a 
synthetic  visual  display  of  cues.  Major  uncertainties 
in  the  design  of  such  a  system  concern  the  effects 
of  recognizer  errors,  appropriate  choice  of  cue 
groups,  and  specification  of  visual  display  of  cue 
symbols.  We  are  planning  a  series  of  experiments 
to  evaluate  the  effects  of  recognizer  errors  on 
speech  reception  using  an  automatic  cueing 
system.  If  the  ASR  does  not  identify  a  phoneme 
correctly,  the-  displayed  cue  will  not  correspond  to 
the  spoken  phoneme  and  therefore  will  be  incorrect. 
Our  strategy  is  to  simulate  the  performance  of  auto¬ 
matic  cueing  systems  that  use  recognizers  with 
various  error  rates  and  patterns  of  error  structures. 
These  systems  will  use  the  traditional  cue  groups  of 
manual  cued  speech  so  that  highly  trained  cue 
receivers  can  evaluate  speech  reception. 

In  a  preliminary  experiment,  we  tested  the  percent 
of  phonemes  that  could  be  inaccurate  and  still  allow 
normal  listeners  to  identify  speech.  We  used  only 
auditory  stimuli,  not  the  cues  from  cued  speech  to 
give  a  baseline  for  further  experiments  with  cued 
speech.  The  stimuli  were  the  Clarke  sentences,*' 
with  portions  corresponding  to  selected  phonemes 
replaced  by  white  noise.  A  given  phoneme  was 
replaced  by  noise  in  the  same  proportion  that  it  was 
misidentified  by  a  known  speech  recognition  system 
(BBN  system).  If  two  percent  of  the  misidentifica- 
tions  that  the  speech  recognition  system  made 
were  /ah/'s  then  two  percent  of  the  phonemes 
replaced  would  be  /ah/’s.  There  were  four  condi¬ 


tions:  10,  20,  30,  and  40  percent  of  the  phonemes 
in  the  group  of  sentences  were  replaced  with  noise. 
Subjects  heard  the  sentences  and  were  instructed 
to  guess  what  a  word  was  even  if  they  were  not 
sure. 

We  calculated  two  measures  from  a  preliminary 
test.  First,  we  counted  the  percent  of  phonemes 
accurately  identified.  In  general,  a  higher  percent 
of  phonemes  were  correctly  identified  in  sentences 
with  a  lower  percent  of  phonemes  replaced  by 
noise.  Second,  we  categorized  the  guesses  sub¬ 
jects  made  by  the  number  of  phonemes  from  which 
they  differed  from  the  original  phoneme.  A  one 
phoneme  change  indicated  that  a  guess  was  either 
a  change  of  one  phoneme  to  another  phoneme  /p/ 
to  /b/,  for  example,  or  the  addition  or  deletion  of 
phoneme.  We  grouped  the  guesses  into  one,  two, 
three,  or  greater  than  three  phoneme  changes. 
The  results  showed  that  in  general  the  category  of 
the  substitution  varied  with  the  percent  of 
phonemes  replaced  by  noise.  In  sentences  with 
lower  percents  of  phonemes  replaced  by  noise, 
lower  phoneme  substitutions  were  more  prevalent 
than  higher  phoneme  substitutions.  Listeners  were 
more  likely  to  guess  a  word  which  was  acoustically 
closer  to  the  target  word.  Similarly,  in  sentences 
with  higher  percents  of  phonemes  replaced  by 
noise,  phoneme  changes  were  more  likely  to  be 
greater  than  three  phonemes  from  the  target  than  in 
sentences  with  lower  percents  of  phonemes 
replaced  by  noise.  Listeners  were  less  likely  to 
guess  a  word  which  was  acoustically  closer  to  the 
target  word,  and  more  likely  to  guess  a  word  which 
was  semantically  and  syntactically  appropriate. 

Frequency  Lowering  of  Amplitude 
Envelopes 

Acoustic  signals  that  retain  much  of  the  intelligibility 
of  the  unprocessed  speech  signal  can  be  synthe¬ 
sized  by  extracting  the  amplitude  envelopes  of  fil¬ 
tered  bands  of  speech  and  using  these  envelopes 
to  modulate  the  amplitudes  of  tones  with  frequen¬ 
cies  at  the  centers  of  the  bands.  These  signals 
successfully  supplement  speechreading  by  normal¬ 
hearing  listeners.**  However,  when  the  center  fre¬ 
quency  of  the  carrier  bands  is  lowered  to  match  the 
residual  hearing  of  listeners  with  severe  high- 
frequency  loss,  the  signals  no  longer  successfully 


2'  M  E.  Magner,  A  Speech  Intelligibility  Test  for  Deaf  Children  (Northanipluo,  Massachusetts:  Clarke  School  for  the  Deaf.  197?). 

22  M.  Breeuwer  and  R.  Plomp,  ’Speechreading  Supplemented  with  Frequency-Selective  Sound- Pressure  Information,"  J.  Acoust  Soc. 
Am.  76:  686-691  (1984);  K.W.  Grant.  L.D.  Braida,  and  R,J.  Renn,  "Auditory  Supplements  to  Speechreading:  Combining  Amplitude 
Envelope  Cues  From  Different  Spectral  Regions  of  Speech,"  J.  Acoust.  Soc.  Am.  92:  2345  (1992);  K.W.  Grant  and  L.D.  Braida. 
"Evaluating  the  Articulation  Index  for  Audiovisual  Input,"  J.  Acoust.  Soc.  Am.  89:  2952-2960  (1991). 
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supplement  speechreading.^^  ip  response  to  a 
report  by  Sheft  and  Yost^^  that  the  failure  of 
frequency-lowered  amplitude  envelopes  to  supple¬ 
ment  speechreading  may  arise  because  listeners 
are  unable  to  compare  the  amplitude  envelopes  of 
signals  across  changes  in  the  center  frequency  of 
the  signal,  Annie  Takeuchi  has  been  examining  the 
effect  of  systematic  training  on  the  ability  of  normal¬ 
hearing  listeners  to  match  amplitude  envelopes  of 
signals  at  different  center  frequencies. 

The  amplitude  envelopes  were  carried  by  narrow- 
band  noise  signals  50-Hz  wide,  centered  at  500, 
1600,  and  3160  Hz.  These  baseline  frequencies 
were  selected  because  they  are  the  center  frequen¬ 
cies  of  the  filtered  bands  of  speech  from  which 
amplitude  envelopes  have  been  extracted  to  sup¬ 
plement  speechreading.  Subjects  were  trained  on 
a  roving  XAB  matching-to-sample  task,  in  which  the 
center  frequencies  of  the  A  and  B  signals  were  sys¬ 
tematically  increased  or  decreased  in  increments  of 
10  percent  up  to  100  percent  and  down  to  -90 
percent  from  the  baseline  frequency,  while  the 
center  frequency  of  the  X  signal  was  fixed  at  the 
baseline  frequency. 

For  all  three  baseline  frequencies,  performance  as 
measured  by  d'  was  best  when  there  was  no  fre¬ 
quency  difference  between  the  X,  A,  and  B  signals, 
and  d'  generally  decreased  when  any  frequency  dif¬ 
ference  was  present.  However,  for  all  frequency 
differences  except  decreases  of  more  than  60 
percent  (-5.7  Barks)  from  the  1600-Hz  baseline  fre¬ 
quency,  d'  remained  well  above  1.76,  which  corre¬ 
sponds  to  70.7  percent  correct  unbiased  responding 
on  the  roving  XAB  task,  the  threshold  value  most 
commonly  measured  in  psychophysical  exper¬ 
iments.  For  the  500-  and  3160-Hz  baseline  condi¬ 
tions,  increases  and  decreases  in  frequency 
produced  similar  results.  For  the  1600-Hz  condi¬ 
tion,  increases  in  frequency  produced  much  smaller 
decrements  in  performance  relative  to  no  frequency 
difference  than  did  decreases  in  frequency.  This 
study  demonstrates  that  with  systematic  training, 
listeners  are  generally  able  to  compare  amplitude 
modulation  patterns  across  large  frequency  differ¬ 
ences. 


1.2.5  Human  Subjects 

There  have  been  no  changes  in  our  protocols 
during  the  past  year,  and  we  do  not  plan  any  for 
the  coming  year.  These  protocols  were  evaluated 
during  the  competitive  review  and  have  been 
renewed  for  the  coming  grant  year. 
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1.3  Multimicrophone  Hearing  Aids 

Sponsor 

National  Institutes  of  Health 
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The  long-term  goal  of  this  research  is  the  develop¬ 
ment  of  sensory  aids  that  improve,  through  the  use 
of  microphone  arrays,  the  ability  of  hearing-impaired 
listeners  to  function  in  complex  acoustic  environ¬ 
ments.  Since  the  reception  of  speech  is  the  most 
important  problem  for  the  hearing  impaired,  the 
target  signal  of  primary  interest  is  speech. 

To  enhance  monaural  speech  reception,  we 
envision  a  microphone  array  that  resolves  the 
incoming  signals  into  simultaneous  directional  chan¬ 
nels,  followed  by  a  coding  operation  that  transforms 
these  resolved  signals  in  such  a  way  that  resolution 
is  preserved  at  the  perceptual  level  after  the  signals 
are  summed  for  presentation  to  a  single  ear.^s  Such 
a  system  would  permit  the  monaural  listener  (like 
the  normal  binaural  listener)  to  monitor  all  directions 
simultaneously,  detect  and  localize  in  the  same 
operation,  and  focus  on  a  single  direction.  Our 
current  work  on  microphone  arrays  is  directed 
toward  the  creation  of  a  single  directional  channel 
containing  the  target  signal  (assumed  to  arise  from 
a  target  source  straight  ahead  of  the  listener)  and 
reduction  of  interference  from  sources  directionally 
Distinct  from  the  target  source.  Parallel  processing 
of  array  outputs  to  achieve  simultaneous  multiple 
directional  channels  will  be  considered  only  after 
further  progress  on  the  coding  problem  has  been 
achieved. 

While  development  of  adaptive  arrays  has  con¬ 
tinued,^^  considerable  effort  in  the  past  year  has 
gone  into  the  design  of  fixed  arrays. 


1.3.1  Fixed  Arrays  for  Hearing  Aids 

Microphone  arrays  with  fixed  (time-invariant) 
weights  are  directed  at  enhancing  a  desired  signal 
from  one  direction  (straight  ahead)  while  attenuating 
spatially  distributed  interference  and  reverberation. 
Using  the  theory  of  sensitivity-constrained  optimal 
beamforming,^^  free-field  arrays  of  head-sizea 
extents  were  studied.^®  The  key  parameters 
affecting  array  design  and  performance  are  the  set 
of  transfer  functions  from  the  target  direction  to 
each  array  microphone  H  (f)  and  the 
intermicrophone  cross-spectral  densities  for 
isotropic  noise  S„(f).  Design  variables  included  ori- 


25  N.l.  Durlach.  R.C.  Corbett,  M.V.  McConnell,  W.M.  Rabinowitz,  P.M  Pe'  ir.,-:  p  m.  Zurek,  "Multimicrophone  Monaural  Hearing 
Aids,"  RESNA  10th  Annual  Conference,  San  Jose,  California,  1987 

26  J.E.  Greenberg  and  P.M.  Zurek,  "Evaluation  of  an  Adaptive  Beamforming  Method  for  Hearing  Aids,"  J.  Acoust.  Soc.  Am.  91: 
1662-1676  (1992), 

22  H.  Cox,  R.M.  Zeskind,  and  T.  Kooij,  "Practical  Supergain,"  IEEE  Trans.  Acoust.  Speech  Sig.  Proc.  ASSP-34;  393-390  (1986). 

2B  R.w.  Stadler  and  W.M.  Rabinow’U,  "On  the  Potential  of  Fixed  Arrays  for  Hearing  Aids,"  submitted  to  J.  Acoosf.  Soc.  Am. 
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entation  of  the  array,  number  and  directionality  of 
the  microphones  within  the  array, “  complexity,  and 
robustness  of  the  required  processing.  For 
broadside  orientation,  a  variety  of  arrays  based  on 
cardioid  and  hypercardioid  microphones  gave  very 
similar  performance.  They  can  provide  broadband 
intelligibility-weighted  directivities  (direct-target  gain 
against  isotropic  noise)  of  7-8  dB  with  easily  imple¬ 
mented  weights  (simple  scalars).  For  endfire  orien¬ 
tation,  similar  directivities  result  with  weights  based 
on  analog  gains  and  pure  time  delays.  However, 
with  weightings  that  are  chosen  independently  for 
each  frequency,  directivities  up  to  ~  1 1  dB  may  be 
practical.  Because  of  sound  diffraction,  placement 
of  arrays  onto  the  head  potentially  impacts  both 
their  design  and  performance  In-situ  measure¬ 
ments  of  H  (f)  and  S22(f),  as  well  as  simplified  the¬ 
oretical  models,  are  suggested  to  explore  the 
optimization  of  head-mounted  arrays. 


1.3.2  Intelligibility-weighted  Measures  of 
Speech-to-Interference  Ratio 

Many  of  our  multimicrophone  subprojects  require 
assessment  of  the  improvement  in  intelligibility  pro¬ 
vided  by  processing  systems  that  are  essentially 
linear.  For  this  purpose,  we  have  developed  meas¬ 
ures  of  speech-to-interference  ratio  and  system 
gain  that  incorporate  factors  affecting  speech  intelli- 
gibility.3°  Such  measures  result  from  simple  modifi¬ 
cations  of  the  Articulation  and  Speech  Transmission 
Indices^^  and  are  applicable  to  the  same  Kinds  of 
signal  degradations  to  which  the  indices  apply. 
Under  restricted  conditions,  such  measures  are 
linear  transformations  of  either  the  Articulation 
Index  or  the  Speech  Transmission  Index  and  are  as 
accurate  as  these  indices  in  predicting  speech  intel¬ 
ligibility.  This  is  a  point  that  we  have  illustrated 
through  a  validation  study.  Depending  on  the  type 
of  signal  degradation,  measures  of  reduced  com¬ 
plexity  can  be  employed.  Such  measures  may 
prove  convenient  in  characterizing  listening  condi¬ 
tions  and  performance  of  speech-transmission 
systems. 


1.4  Cochlear  Prostheses 

Sponsor 

National  Institutes  of  Health 
Contract  2  P01  DC0036132 

Project  Staff 

Professor  Louis  D.  Braida,  Lorraine  A.  Delhorne, 
Dr.  Donald  K.  Eddington,  Dr.  William  M.  Rabinowitz 

The  overall  goals  of  this  research  are  to  determine 
and  understand  the  potential  and  limitations  of 
cochlear  prostheses  and  develop  improved 
prostheses.  Postlingually  deafened  adults  are 
implanted  with  the  Ineraid  multichannel  cochlear 
prosthesis  and  participate  in  intensive  multifactorial 
studies.  This  research  capitalizes  on  the  direct 
accessibility  of  the  implanted  electrode  array  via  a 
percutaneous  connector. 

During  the  past  year,  work  has  focused  on  (1)  anal¬ 
ysis  of  cue  integration  in  audiovisual  speech  recep¬ 
tion  and  (2)  alternative  speech  processing  for 
improved  implant  performance.  Progress  in  (1)  is 
described  below.  The  work  in  (2)  is  performed  with 
Joseph  Tierney,  and  Marc  Zissmann;  progress  is 
described  in  Part  V,  Section  3,  Chapter  1,  section 
1.7.2. 

The  work  on  audiovisual  integration  assesses  the 
ability  of  an  implantee  to  combine  cues  that  are 
available  from  separately  using  vision  and  audition. 
Because  most  implantees  require  audiovisual  input 
for  reliable  communication,  analysis  of  integration  is 
particularly  significant. 

A  series  of  experiments  on  closed-set  identification 
of  consonants  and  vowels  were  begun.  The  stimuli 
are  presented  via  computer-controlled  videodisc. 
Results  are  being  obtained  using  vision  alone, 
audio  alone,  and  audiovisually.  At  present,  most 
subjects  are  nearing  completion  of  the  testing; 
asymptotic  levels  of  performance  have  been 
reached  and  sufficient  data  have  been  obtained  to 
provide  reliable  estimates  of  confusion  patterns  for 
each  of  the  three  test  modes.  Analyses  to  deter- 


29  W.  Soede,  Improvement  of  Speech  Intelligibility  in  Noise:  Development  and  Evaluation  of  a  New  Hearing  Instrument  Based  on  Array 
Technology,  Ph.D.  diss.,  Delft  University  of  Technology,  The  Netherlands.  1990. 

30  j.E.  Greenberg,  P.M.  Peterson,  and  P.M.  Zurek,  "Inelligibility-Weighted  Measures  of  Speech-to-interference  Ratio  and  Speech 
System  Gain,”  submitted  to  J.  Acoust.  Soc.  Am. 

31  American  National  Standard  Institute,  'American  National  Standard  Methods  for  the  Calculation  of  the  Articulation  Index,”  (New  York,: 
ANSI,  1969);  H.J.M.  Steeneken  and  T.  Houtgast,  'A  Physical  Method  for  Measuring  Speech-transmission  Quality,”  J.  Acoust.  Soc. 
Am.  67:  318-326  (1980). 

32  Subcontract  from  Massachusetts  Eye  and  Ear  Infirmary.  Dr.  Joseph  B.  Nadol,  M.D.,  Principal  Investigator. 
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mine  the  quantitative  efficiency  of  audiovisual  inte¬ 
gration  are  underway.  These  analyses  exploit  an 
analytical  framework  (Braida,  1990)  that  quantifies 
how  performance  in  a  combined  mode  (e.g.,  audi¬ 
ovisual  stimulation)  is  influenced  by  integration  or 
interference  of  cues  available  in  the  isolated  com¬ 
ponent  modes  (e.g.,  audio  and  visual  stimulation 
alone). 


1.5  Binaural  Hearing 

Sponsor 

National  Institutes  of  Health 
Grant  5  R01  DC00100» 

Project  Staff 

Nathaniel  I.  Durlach,  Dr.  Patrick  M.  Zurek 

The  long-term  goal  of  this  program  is  (1)  to  develop 
an  integrated,  quantitative  theory  of  binaural  inter¬ 
action  that  is  consistent  with  psychophysical  and 
physiological  data  on  normal  and  impaired  auditory 
systems  and  (2)  to  apply  our  results  to  the  diag¬ 
nosis  and  treatment  of  hearing  impairments. 

Experimental  research  in  this  area  has  focused  on 
identifying  the  stimulus  variables  responsible  for 
in-head  lateralization  of  auditory  images.  It  is  well 
established  that  the  interaural  relations  that  exist 
early  after  the  onset  of  a  sound  have  a  strong  influ¬ 
ence  on  where  the  subsequent  sound  image  is 
heard.3^  It  is  believed  that  this  effect  allows  very 
good  sound  source  localization  in  highly  reverberant 
environments.  Our  recent  work  has  confirmed  that 
such  effects  measured  using  an  acoustic  pointer 
correspond  well  with  measurements  of  interaural 
parameter  resolution.^  Further  work  has  shown  that 
the  influence  of  the  onset  cue  can  extend  over  a 
relatively  long  (hundreds  of  milliseconds)  subse¬ 
quent  stimulus,  depending  on  both  the  ambiguity  of 


the  interaural  cues  in  the  subsequent  stimulus  and 
on  whether  it  appears  to  constitute  a  separate 
"auditory  object”.* 


1.6  Clinical  Applications  of  Binaural 
Hearing 

Sponsor 

National  Institutes  of  Health 
Contract  7  R29  DC004283" 

Project  Staff 

Dr.  Patrick  M.  Zurek 

In  this  project,  we  seek  to  apply  our  understanding 
of  binaural  advantages  in  speech  reception*  to  the 
prescription  of  hearing  aids,  both  monaural  and 
binaural,  and  to  test  this  understanding  further.  In 
particular,  the  experimental  work  is  focused  on  the 
performance  of  both  normal  hearing  and  hearing- 
impaired  listeners  on  tests  of  binaural  detection, 
localization,  and  contralateral  masking  with  various 
amplification  patterns  applied  at  both  ears. 


1.7  Tactile  Communication  of  Speech 

Sponsor 

National  Institutes  of  Health 
Grant  2  R01  DC00126 

Project  Staff 

Lorraine  A.  Delhorne,  Nathaniel  I.  Durlach,  Hong  Z. 
Tan,  Dr.  William  M.  Rabinowitz,  Dr.  Charlotte  M. 
Reed,  Dr.  Mandayam  A.  Srinivasan 

The  ultimate  goal  of  this  research  is  to  develop 
tactual  aids  for  the  deaf  and  deaf-blind  that  can 
serve  as  substitutes  for  hearing  in  speech  commu- 


33  Subcontract  from  Boston  University.  Professor  H.  Steven  Colburn,  Principal  Investigator. 

3*  P.M.  Zurek,  "The  Precedence  Effect,"  in  Directional  Hearing,  eds.  W.A.  Yost  and  G.  Gourevitch  (New  York;  Springer- Verlag,  1987); 
P.M.  Zurek,  "A  Note  on  Onset  Effects  in  Binaural  Hearing,"  J.  Acoust.  Soc.  Am.  93: 1200-1201  (1993). 

35  B.G.  Shinn-Cunningham,  P.M.  Zurek,  and  N.l.  Durlach,  "Adjustment  and  Discrimination  Measurements  of  the  Precedence  Effect"  J. 
Acoust.  Soc.  Am.,  forthcoming. 

36  R.L.  Freyman,  P.M.  Zurek,  U.  Balakrishnan.  and  C.  Yuan-Chuan,  "Onset  Dominance  in  Lateralization,"  submitted  to  J.  Acoust.  Soc. 
Am. 

37  Subcontract  from  University  of  Connecticut.  Dr.  Janet  D.  Koehnke,  Principal  Investigator. 

38  P.M.  Zurek,  "Binaural  Advantages  and  Directional  Effects  in  Speech  Intelligibility,"  in  Acoustical  Factors  Affecting  Hearing-aid  Perfor¬ 
mance,  eds.  G.A.  Studebaker  and  I.  Hochberg  (Boston:  Allyn  and  Bacon,  1993.) 
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nication.3s  The  objectives  and  specific  aims  of  our 
research  are  as  follows: 

1.  Basic  study  of  encoding  and  display  schemes 
to  develop  methods  of  displaying  acoustic 
signals  to  the  tactual  sense  for  optimal  informa¬ 
tion  transfer. 

2.  Research  on  tactual  systems  designed  specif¬ 
ically  to  aid  speechreading,  including  systems 
based  on  both  acoustic  and  articulatory-based 
signal  processing. 

3.  Evaluations  of  experienced  deaf  users  of  port¬ 
able,  wearable  tactual  aids  to  (a)  determine 
improvements  to  speech  reception  provided  by 
these  aids  and  (b)  compare  this  performance  to 
that  of  users  of  other  types  of  auditory 
prostheses. 


1.7.1  Basic  Study  of  Encoding  and  Dispiay 
Schemes 

A  study  of  joint-angle  discrimination  (JND)  and 
identification  of  the  proximal  interpalangeal  (PIP) 
joint  of  the  index  finger  has  been  completed. 
During  the  current  year,  additional  data  have  been 
collected  on  the  discriminability  of  the  PIP  joint 
angle  as  a  function  of  metacarpal  phalangeal 
(MCP)  joint  position.  The  data  indicate  that  JNDs 
for  PIP  joint  position  are  independent  of  the  MCP 
joint  position  and  average  2.6  degrees.  Additional 
data  on  PIP  joint-angle  discrimination  as  a  function 
of  the  PIP  joint  reference  position  indicate  subject- 
dependent  results:  JNDs  for  two  subjects  are 
roughly  independent  of  the  reference  position  (aver¬ 
aging  2.4-2.5  degrees),  while  JNDs  for  a  third 
subject  increased  with  greater  flexion  of  the  PIP 
joint. 

Preliminary  work  has  begun  on  a  study  concerned 
with  defining  the  range  of  amplitudes  and  frequen¬ 
cies  that  can  be  used  to  stimulate  the  tactual 
system,  as  well  as  specifying  independent  percep¬ 
tual  channels  and  resolution  within  these  channels. 


This  work  includes  development  of  a  device  for 
stimulus  presentation  consisting  of  a  plastic  bar 
attached  to  a  rotary  motor  with  2000  discrete, 
specifiable  steps.  Pilot  studies  indicate  that  this 
device  is  capable  of  presenting  signals  in  the  range 
of  1  to  100  Hz  over  a  fairly  broad  dynamic  range. 
Existing  software  has  been  modified  to  collect  data 
on  amplitude  and  frequency  resolution  in  a  one- 
interval  two-alternative  forced-choice  procedure. 

Work  has  also  begun  on  a  study  of  the  manual  dis¬ 
crimination  of  the  property  of  thickness.  A  series  of 
plates  varying  in  thickness  has  been  prepared  for 
use  as  the  stimuli  in  this  study,  and  a  device  has 
been  built  for  presenting  these  plates  to  subjects  in 
experiments  concerned  with  the  ability  to  discrimi¬ 
nate  thickness  as  a  function  of  the  reference  value. 
Finally,  a  manuscript  summarizing  research  on  the 
manual  discrimination  of  length,  force,  and  compli¬ 
ance  has  been  prepared.'*'* 


1.7.2  Tactual  Supplements  to 
Speechreading 

In  the  area  of  acoustic  supplements  to  speech¬ 
reading,  work  during  the  current  year  has  been  con¬ 
cerned  with  developing  a  more  complete 
understanding  of  the  differential  results  obtained  for 
auditory  and  tactile  presentation  of  acoustic-based 
low-bandwidth  supplements  to  speechreading.*’ 
This  work  has  included  signal  processing  and  soft¬ 
ware  development  for  a  set  of  psychophysical 
measurements  of  modulation  resolution.  In  the 
area  of  articulatory  supplements  to  speechreading, 
work  has  been  concerned  with  designing  an 
encoding  scheme  for  presenting  various  types  of 
articulatory  information  through  a  seven-channel 
vibratory  array.  The  vibrators  can  be  controlled 
through  sine-wave  inputs  and  various  vibratory  pat¬ 
terns  can  be  used  to  encode  articulatory  features  of 
consonants  and  vowels.  Preliminary  psychophys¬ 
ical  studies  are  being  conducted  to  determine  a  set 
of  vibratory  patterns  that  provide  sufficient  informa¬ 
tion  for  this  problem. 


39  c.M.  Reed,  N.l.  Durlach,  and  L.A.  Delhorne,  "Historical  Overview  of  Tactile  Aid  Research,"  Proceedings  of  the  Second  International 
Conference  on  Tactile  Aids,  Hearing  Aids,  and  Cochlear  Implant,  eds.  A.  Risberg  and  K.-E.  Spens  (Stockholm,  Sweden:  Royal 
Institute  of  Technology,  forthcoming). 

«  H.Z.  Tan,  X.D.  Pang,  and  N.l.  Durlach,  "Manual  Resolution  of  Length,  Force,  and  Compliance,"  Winter  Annual  Meeting  of  the  Amer¬ 
ican  Society  of  Mechanical  Engineers,  Anaheim,  California,  November  8-13,  1992. 

M.J.  Besing,  C  M.  Reed,  and  K.W.  Grant,  "Tactual  Presentation  of  Low-Bandwidth  Envelope  Signals:  Benefits  to  Speechreading," 
ASHA  33:  212,  1991. 
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1 .7.3  Evaluation  of  Practical  Aids 

Three  new  subjects  have  joined  our  field  study  of 
tactile-aid  users,  bringing  the  total  number  of  sub¬ 
jects  to  eight.  During  the  current  year,  six  of  the 
eight  subjects  visited  our  laboratory  for  evaluations 
of  their  speechreading  performance  with  the  Tactaid 
Vll.  Data  are  now  available  from  three  separate 
testing  sessions  with  one  subject  (JL),  two  testing 
sessions  with  two  subjects  (RM,  RS),  and  one 
session  with  each  of  the  remaining  subjects.  The 
evaluations  have  been  concerned  primarily  with 
assessing  speechreading  ability  with  and  without 
tactile  devices,  as  well  as  with  measuring 
discriminability  of  simple  speech  segments  through 
the  use  of  tactile  devices  by  themselves.  Thus  far, 
tactile  devices  employed  in  the  study  include 
Tactaid  Vll  (which  ail  subjects  in  the  field  evaluation 
have  received),  Tactaid  II  (tested  only  on  three  of 
the  subjects  who  had  used  the  Tactaid  II  prior  to 
Tactaid  Vll),  and  a  high-performance  single-channel 
vibrator  (Alpha-M  AV-6  Minishaker)  employed  only 
in  laboratory  testing. 

Results  of  segmental  discrimination  tests  conducted 
with  Tactaid  Vll  indicate  that,  averaged  across  sub¬ 
jects  and  across  consonant  and  vowel  pairs,  perfor¬ 
mance  is  roughly  70  percent  correct.''^  Additional 
data  collected  on  one  subject  with  Tactaid  If 
showed  that  performance  for  both  devices  was 
approximately  similar.  Results  of  tests  conducted 
for  speechreading  alone  and  speechreading  in  com¬ 
bination  with  a  tactile  device  (using  sentence  and 
continuous-discourse  materials)  showed  a  range  of 
performance  across  subjects.  For  example,  for  the 
reception  of  words  in  CUNY  sentences,  scores  from 
all  subjects  ranged  from  32  to  86  percent  correct 
(averaging  53  percent)  for  speechreading  alone  and 
29  to  93  percent  (averaging  58  percent)  for  speech¬ 
reading  plus  Tactaid  Vll.  Of  the  six  subjects  for 
whom  data  are  currently  available  on  this  task, 
improvements  to  speechreading  with  the  use  of 
Tactaid  Vll  were  observed  for  four  subjects  (ranging 


from  5  to  17  percentage  points  improvement),  while 
two  subjects  have  shown  no  improvements  to 
speechreading  of  CUNY  sentences  with  Tactaid  Vll 
thus  far.  Data  from  three  subjects  tested  with 
Tactaid  II  (as  well  as  with  Tactaid  Vll)  on  this  task 
indicate  greater  improvements  to  speechreading  (by 
approximately  8  percentage  points)  when  using 
Tactaid  II  compared  with  Tactaid  Vll.  Some  prelimi¬ 
nary  data  obtained  on  three  subjects  for  speech¬ 
reading  of  CUNY  sentences  with  the  Minishaker 
indicate  that  the  benefits  provided  by  this  device 
are  comparable  to  those  observed  with  Tactaid  II. 
The  results  of  a  questionnaire  assessing  subjects' 
use  of  tactile  devices  indicate  that  most  prefer 
Tactaid  Vll  to  Tactaid  II.  This  is  primarily  because 
of  the  greater  utility  of  Tactaid  Vll  in  detecting  and 
recognizing  environmental  sounds  in  addition  to  the 
benefits  it  provides  for  speechreading.*^ 


1.7.4  Continuation  and  Compietion  of 
Previous  Work  on  Natural  Methods  of 
Tactual  Communication 

During  the  past  year,  we  have  published  summaries 
of  our  work  on  the  communication  abilities  of  deaf- 
blind  subjects  who  use  various  natural  methods  of 
tactual  communication,  including  the  Tadoma 
method  of  speech  reception  and  tactual  reception 
of  fingerspelling  and  sign  language.**  A  manuscript 
describing  a  series  of  experiments  conducted  to 
demonstrate  improvements  to  speech  reception 
through  Tadoma  using  supplementary  tactual  infor¬ 
mation  has  also  been  published.*® 

1.8  Super-Auditory  Localization  for 
Improved  Human-Machine  Interfaces 

Sponsor 

U.S.  Air  Force  -  Office  of  Scientific  Research 
Grant  AFOSR  90-0200 


<2  C.M.  Reed,  L.A.  Delhorne,  and  N.l.  Durlach,  'Results  Obtained  with  Tactaid  II  and  Tactaid  Vll,"  Proceedings  of  the  Second  Interna¬ 
tional  Conference  on  Tactile  Aids,  Hearing  Aids,  and  Cochlear  Implants,  eds.  A,  Risberg  and  K.-E.  Spans  (Stockholm,  Sweden: 
Royal  Institute  of  Technology,  forthcoming. 

<3  C.M.  Reed  and  L.A.  Delhorne,  "Field  Study  of  Deaf  Adult  Users  of  Tactaid  II  and  Tactaid  Vll,"  Presentation  made  at  the  Annual 
Convention  of  the  Association  for  Late  Deafened  Adults,  Boston,  Massachusetts,  September  10-13,  1992. 

**  C.M.  Reed,  N.l.  Durlach,  and  L.A.  Delhorne,  "Natural  Methods  of  Tactual  Communication,"  chapter  in  Tactile  Aids  for  the  Hearing 
Impaired,  ed.  Ian  P.  Summers,  (Whurr  Publishers  Limited,  1992),  pages  218-230;  C.M.  Reed,  N.l.  Durlach,  and  L.A.  Delhorne,  "The 
Tactual  Reception  of  Speech,  Fingerspelling,  and  Sign  Language  by  the  Deaf-Blind,"  SID  Digest  102-105  (1992). 

«  C.M.  Reed,  W.M.  Rabinowitz,  N.l.  Durlach,  L.A,  Delhorne,  L.D.  Braida,  J.C.  Pemberton,  B.D.  Mulcahey,  and  D.L.  Washington,  "Ana¬ 
lytic  Study  of  the  Tadoma  Method:  Improving  Performance  through  the  Use  of  Supplementary  Tactual  Displays,"  J.  Speech  Hear. 
Res.  35:  450-465  (1992). 
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Figure  1. 


Project  Staff 

Nathaniel  I.  Durlach,  Eric  M.  Fuchs,  Dr.  Richard  M. 
Held,  Dr.  William  M.  Rabinowitz,  Yun  Shao, 
Barbara  G.  Shinn-C’jnningham,  Min  Wei 

General  background  on  this  project  was  presented 
in  pages  312-313  of  the  annual  RLE  Progress 
Report  Number  134.  During  the  past  year,  work 
has  advanced  along  the  following  fronts. 

1.  Further  localization  identification  experiments 
have  been  conducted  using  a  transformation  of 
the  function  relating  head-related  transfer  func¬ 
tion  to  azimuth  such  that  azimuthal  space  is 
magnified  in  front  and  minified  off  to  the  side 
(see  figure  1).  As  expected,  results  show 
increased  resolution  in  the  center  and 
decreased  resolution  to  the  side.  In  addition,  in 
contrast  to  our  initial  impression  that  no 
sensorimotor  adaptation  took  place,  i.e.,  that 
response  bias  failed  to  decay  with  an  increase 
in  exposure  time,  our  results  now  clearly  show 
consistent  and  substantial  adaptation  (in  terms 
of  both  the  direct  effect  and  the  negative  after 
effect).  However,  results  also  indicate  that  the 
decrease  in  response  bias  over  time  is  accom¬ 
panied  by  a  decrease  in  resolution  over  time. 
We  are  now  studying  these  data  as  well  as 
performing  further  experiments  to  determine 
the  underlying  causes  of  this  phenomenon. 
Illustrative  data  showing  the  changes  in  both 
resolution  and  bias  are  presented  in  figure  2. 

2.  Further  work  on  facilities  development  has 
included  four  projects.  First,  we  have  acquired, 
integrated,  and  tested  a  new  analog  hardware 
processor  to  cross-check  the  results  already 
obtained  with  the  convolvotron.  Second,  we 
have  constructed  a  pseudophone  (a  head-worn 
microphone/earphone/amplification  system  with 


controllable  geometry  and  acoustic  character¬ 
istics)  to  provide  us  with  a  system  in  which 
localization  cues  are  transformed  acoustically 
rather  than  by  signal  processing  and  in  which 
no  head-tracking  is  required.  Third,  we  have 
begun  to  develop  a  new  head  tracker  based  on 
inertial  sensors  for  use  with  acoustical  simu¬ 
lation  systems.  Fourth,  and  finally,  we  have 
developed  an  additional  experimental  set-up 
which  allows  us  to  simulate  a  sound  source 
held  in  the  hand  so  that  we  can  make  direct 
comparisons  with  classical  studies  on  adaption 
in  the  visual  sense  using  optical  prisms  and  a 
manual  pointing  response. 

3.  Further  work  on  the  dissemination  of  our 
results  has  included  talks  at  the  Society  for 
Information  Displays,  the  First  International 
Conference  on  Auditory  Displays,  and  the 
Acoustical  Society  of  America,  as  well  as  the 
publication  of  an  article  in  Preserice  (Durlach  et 
al.,  1992). 


1.9  Research  on  Reduced-Capability 
Human  Hands 

Sponsor 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  NOOOU-90-J-1935 

Project  Staff 

Lorraine  A.  Delhorne,  Nathaniel  I.  Durlach,  Dr. 
Mandayam  A.  Srinivasan 

General  background  on  this  project  was  presented 
in  RLE  Progress  Report  Number  134  (pages 
313-314).  During  the  past  year,  attention  has  been 
focused  on  further  data  collection  and  data  analysis 
for  constrained-hand  performance  in  a  subset  of  the 
tasks  previously  used  by  the  Navy  in  the  TOPS 
teleoperator  study  (Smith  and  Shimamoto,  1991). 

Although  direct  comparisons  between  the  results  of 
our  tests  and  those  contained  in  the  TOPS  project 
are  difficult  to  interpret  because  of  the  many  differ¬ 
ences  in  the  two  testing  situations  (e.g.,  direct 
vision  versus  vision  through  a  helmet  mounted 
display,  working  in  air  versus  working  under  water, 
etc.),  our  results  suggest  a  number  of  important 
conclusions. 

First,  the  results  indicate  that  all  tasks  considered 
were  essentially  two-finger  tasks.  We  say  this 
because  (1)  the  results  obtained  with  two  fingers 
were  nearly  as  good  as  those  obtained  with  more 
fingers  (both  for  the  real  hands  and  for  the  teleoper¬ 
ator  hands)  and  (2)  we  know  from  casual  obsen/a- 
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Azimuth 


Figure  2. 

tion  that  there  exist  many  tasks  for  which  two 
fingers  are  grossly  inadequate. 

Second,  the  results  indicate,  quite  surprisingly,  that 
performing  the  task  with  a  normal  human  hand 
using  a  pair  of  pliers  is  (1)  much  superior  to  per¬ 
forming  the  task  with  either  of  the  teleoperator 
systems  and  (2)  nearly  as  good  as  performing  the 
task  with  the  system  consisting  of  the  thumb  and 
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index  finger  of  the  normal  human  hand.  Result  (1) 
suggests  that  the  two  teleoperator  systems  had  a 
great  deal  of  mechanical  "slop,"  i.e.,  that  informa¬ 
tion  on  position  and  force  (in  one  or  both  directions) 
exhibited  only  a  low  level  of  fidelity.  Item  (2)  sug¬ 
gests  that  for  the  given  tasks  both  the  compliant 
nature  of  the  human  finger  pads  and  the  tactile 
sensing  in  the  human  finger  pads  were  relatively 
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unimportant.  (This  result  has  been  confirmed  by 
recent  experiments  with  anesthetized  fingers.)  In 
our  opinion,  this  feature  of  the  results  would  have 
been  radically  different  if  the  operator  were 
deprived  of  visual  information.  Overall,  we  believe 
our  results  are  consistent  with  the  idea  that  visual 
information  is  more  redundant  with  tactile  informa¬ 
tion  than  with  force  information.  Stated  differently, 
for  non-compliant  objects,  vision  can  tell  you  a  lot 
about  shape  and  texture,  but  not  about  applied 
force. 

As  discussed  in  our  previous  report  (RLE  Progress 
Report  Number  134),  we  believe  that  the  problem 
of  defining  an  adequate  set  of  basis  tasks  is  the 
most  crucial  research  problem  now  facing  the  study 
of  hand  function  and  its  applications  to  the  design 
and  evaluation  of  artificial  hands. 

Current  work  on  this  project  is  being  devoted  to 
preparation  of  a  final  report  for  this  grant.  It  is 
anticipated  that  further  research  in  this  area  will  be 
carried  out  under  our  other  grants  concerned  with 
manual  sensing  and  manipulation. 

1.10  Mechanistic  Modeling  of  Primate 
Fingerpad 

Sponsor 

National  Institutes  of  Health 
Grant  5  R29  DC00625 

Project  Staff 

Kiran  B.  Dandekar,  Dr.  Mandayam  A.  Srinivasan, 
Rogeve  J.  Gulati 

Whenever  we  touch  an  object,  the  source  of  all 
tactile  information  is  the  spatio-temporal  distribution 
of  mechanical  loads  on  the  skin  at  the  contact  inter¬ 
face.  The  relationship  between  these  loads  and  the 
resulting  stresses  and  strains  at  the  nerve  terminals 
within  the  skin  plays  a  fundamental  role  in  the 
neural  coding  of  tactile  information.  Although 
empirical  determination  of  the  stress  or  strain  state 
of  a  mechanoreceptor  is  not  possible  at  present, 
mechanistic  models  of  the  skin  and  subcutaneous 
tissues  enable  us  to  generate  testable  hypotheses 
on  skin  deformations  and  associated  peripheral 
neural  responses.  Then  verification  of  the  hypoth¬ 
eses  can  be  accomplished  by  comparing  the  calcu¬ 


lated  results  from  the  models  with  biomechanical 
data  on  the  deformation  of  skin  and  subcutaneous 
tissues,  and  neurophysiological  data  from 
recordings  of  the  responses  of  single  neural  fibers. 
The  research  under  this  grant  is  directed  towards 
applying  analytical  and  computational  mechanics  to 
analyze  the  biomechanical  aspects  of  touch — ^the 
mechanics  of  contact,  transmission  of  mechanical 
signals  through  skin,  and  their  transduction  into 
neural  impulses  by  the  mechanoreceptors. 


1.10.1  Determination  of  Geometric  and 
Materiai  Properties  of  the  Primate  Fingertip 
(distal  phalanx) 

The  first  step  in  performing  mechanistic  analyses  of 
the  primate  fingertip  is  to  determine  its  geometric 
and  material  properties.  The  three-dimensional 
{3D)  external  geometry  of  the  primate  fingertips  was 
determined  from  accurate  epoxy  replicas  of  human 
and  monkey  fingertips.  Using  a  videomicroscopy 
setup,  we  obtained  images  of  orthographic 
projections  of  the  epoxy  replicas  at  various  known 
orientations.  The  images  were  then  digitized  and 
processed  to  determine  the  boundary  of  the  finger 
at  each  orientation.  By  combining  the  boundary 
data  for  all  the  different  orientations,  we  were  able 
to  reconstruct  the  3D  external  geometry  of  the 
fingertip.'^  We  have  reconstructed  several  human 
and  monkey  fingertips  by  this  method. 

For  mechanical  modeling  of  the  human  fingerpad, 
the  Poisson's  ratio,  which  is  a  measure  of  its 
compressibility,  is  required  as  an  input  to  math¬ 
ematical  models.  The  Poisson’s  ratio  for  the 
human  fingerpad  in  vivo  is  unknown  at  present. 
Available  in  vitro  data  for  skin  cannot  be  used  as 
the  mechanical  properties  are  known  to  be  different 
for  in  vivo  and  in  vitro  conditions.  Also,  the 
fingerpad  as  a  heterogeneous  structure  behaves 
differently  from  the  skin  by  itself.  We  have  estab¬ 
lished  an  experimental  setup  where  non-invasive 
measurements  of  the  in  vivo  compressibility  of  the 
fingertip  can  be  done  on  human  subjects.^^  We 
have  indented  the  fingerpads  of  four  subjects  with 
three  different  indentors  and  measured  the  change 
in  volume  of  the  fingerpad.  Our  results  show  that 
the  compressibility  of  the  fingertip  increases  with 
increases  in  both  depth  of  indentation  and  contact 
area  with  the  indentor.  The  highest  change  in 


*6  T.R.R.  Perez  and  M.A.  Srinivasan,  "Videomicroscopic  Reconstruction  of  the  Human  Finger,"  Project  report  to  the  MIT  Minority 
Summer  Science  Research  Program,  1 992. 

<7  M.A.  Srinivasan,  R.J.  Gulati,  and  K.  Dandekar,  ’In  Vivo  Compressibility  of  the  Human  Fingertip,”  Proceedings  of  the  Bioengineering 
Division,  American  Society  of  Mechanical  Engineers  Winter  Annual  Meeting,  Anaheim,  California,  November  8-13,  1992. 
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fingertip  volume  was  about  5  percent,  indicating 
that  the  Poisson's  ratio  of  the  fingerpad  is  close  to, 
but  not  equal  to  0.5/® 


1.10.2  Fingertip  Modeis  and  Finite  Element 
Analyses 

In  past  experiments,  we  have  indented  the 
fingerpads  of  humans  and  monkeys  in  vivo  using  a 
line  load  delivered  by  a  sharp  wedge  and  then  pho¬ 
tographed  the  resulting  skin  surface  deflections. 
We  have  shown  that  the  homogeneous  elastic 
model  of  the  fingertip  only  roughly  approximates  the 
experimental  data,  while  a  simple  alternative  model, 
which  views  the  fingertip  as  an  elastic  membrane 
filled  with  an  incompressible  fluid  (like  a  "waterbed”) 
predicted  the  observed  profiles  very  well.^® 

More  recently,  we  have  performed  lint  nnd  non¬ 
linear  finite  element  analyses  of  a  eries  of 
mechanistic  models  of  the  fingerpad  under  a  variety 
of  mechanical  stimuli.®®  The  models  range  from  a 
semi-infinite  medium  to  a  three-dimensional  model 
based  on  the  actual  finger  geometry,  composed  of 
either  a  homogeneous  elastic  material  or  a  thick 
elastic  shell  containing  a  fluid.  Simulations  of  the 
mechanistic  aspects  of  neurophysiological  exper¬ 
iments  involving  mapping  of  receptive  fields  with 
single  point  loads,  determination  of  spatial  resol¬ 
ution  of  two-point  stimuli,  and  indentations  by  single 
bars  as  well  as  periodic  and  aperiodic  gratings  have 
been  carried  out  for  2D  homogeneous  models.  We 
have  also  solved  the  nonlinear  contact  problem  of 
indentations  by  cylindrical  objects.  The  large 
number  of  numerical  calculations  needed  even  for 
the  linear  two-dimensional  models  necessitated  the 
use  of  the  MIT  supercomputer.  The  results  show, 
for  example,  that  the  strain  energy  density  at  the 


receptor  site  is  probably  the  relevant  stimulus  that 
causes  the  responses  recorded  from  slowly 
adapting  afferent  fibers. 

After  analyzing  the  2D  models,  we  found  that  a  3D 
model  based  on  the  actual  geometry  of  the 
fingerpad  was  necessary  to  generate  testable 
hypotheses  about  the  transduction  of  mechanical 
stimuli  in  primate  tactile  sense.  Using  the  fingertip 
geometry  data  obtained  by  employing  the 
videomicroscopy  system,  we  constructed  a  realistic 
computer  model  of  the  primate  fingerpad.  We  then 
generated  a  finite  element  mesh  in  the  model  using 
the  solid  modeler  software  PATRAN.  After  having 
completed  some  preliminary  checks,  we  are  cur¬ 
rently  simulating  neurophysiological  experiments 
using  the  3D  model. 


1.10.3  Tactile  Sensing  of  Microtexture, 
Shape,  and  Softness 

We  have  been  collaborating  with  Professor  R.H. 
LaMotte  of  Yale  University  School  of  Medicine  in 
conducting  psychophysical  and  neurophysiological 
studies  on  how  humans  sense  by  touch  alone  the 
microtextures,  shapes  and  softness  of  objects.®’  By 
using  methods  of  photolithography,  we  etched  sub¬ 
micron-high  bar  gratings  on  glass  plates.  We  have 
shown  that  humans  can  detect  extremly  fine  tex¬ 
tures  composed  of  parallel  bars  only  50  nanometers 
high.®2  Our  earlier  neurophysiological  recordings 
with  bigger  texture  elements  indicate  that  when 
such  fine  textures  are  stroked,  skin  vibrates  and 
causes  Pacinian  Corpuscles  to  respond,  thus  ena¬ 
bling  detection  of  the  microtexure.®® 

Based  on  a  theoretical  analysis  of  the  mechanics  of 
contact,  we  have  demonstrated  that  the  receptors 
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submitted  to  J.  Biomech.  (1992). 
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respond  to  the  low-pass  filtered  versions  of  surface 
pressures.®^  Thus,  curvature  of  the  skin  surface 
under  an  object,  which  we  know  from  differential 
geometry  is  approximated  by  the  second  spatial 
derivative  of  surface  deflection,  is  coded  by 
exploiting  its  relation  to  surface  pressure  rather  than 
differentiating  (which  is  a  noise  enhancing  process). 
For  the  next  set  of  experiments,  we  have  designed 
and  fabricated  a  high  performance  tactile  stimulator 
controlled  by  a  PC  through  transputers.  Our 
ongoing  experiments  on  shape  sensing  involve  use 
of  precisely  manufactured  cylindrical,  ellipsoidal, 
and  spherical  surfaces  as  stimuli.® 

We  have  also  shown  that  the  human  discriminability 
of  compliance  of  objects  depends  on  whether  the 
object  has  a  deformable  or  rigid  surface.  When  the 
surface  is  deformable,  the  spatial  pressure  distribu¬ 
tion  within  the  contact  region  is  dependent  on 
object  compliance,  and  hence  information  from 
cutaneous  mechanoreceptors  is  sufficient  for  dis¬ 
crimination  of  subtle  differences  in  compliance. 
When  the  surface  is  rigid,  kinesthetic  information  is 
necessary  for  discrimination,  and  the  discriminability 
is  much  poorer  than  that  for  objects  with 
deformable  surfaces. 


1.10.4  Development  of  a  Computational 
Theory  of  Touch 

Although  the  "hardware"  of  the  tactile  apparatus  in 
humans  and  robots  is  different,  they  have  the 
common  feature  of  mechanosensors  embedded  in 
a  deformable  medium.  Thus,  the  computational 
problem  of  coding  (predicting  sensor  response  for  a 
given  mechanical  stimulus  at  the  surface)  and 
decoding  (inferring  the  mechanical  stimulus  at  the 
surface  by  suitably  processing  the  sensor  response) 
need  similar  mechanistic  analyses  for  their  solution. 
We  have  developed  a  computational  theory  for  an 
idealized  medium  subjected  to  arbitrary  pressure  or 
displacement  loading  conditions  and  have  provided 


explicit  formulae  for  the  coding  and  decoding  prob¬ 
lems.® 

In  collaboration  with  Dr.  Annaswamy  of  the  Depart¬ 
ment  of  Mechanical  Engineering  at  MIT,  we  have 
investigated  some  of  the  identification  and  control 
problems  that  occur  in  the  context  of  manipulation 
when  compliance  is  present  in  the  end-effectors  as 
well  as  in  the  object.^^  In  order  to  understand  the 
fundamental  aspects  of  these  tasks,  we  have  ana¬ 
lyzed  the  problem  of  identification  of  compliant 
objects  with  a  single  finger  contact,  as  well  as 
under  a  two-finger  grasp.  Assuming  that  the  finger 
and  the  compliant  object  are  constrained  to  deform 
along  a  single  spatial  dimension,  we  have  carried 
out  parameter  identification  using  either  force  or 
displacement  inputs  to  the  rigid  backing  of  the  end- 
effector.  Based  on  this  analysis,  control  strategies 
are  developed  to  achieve  a  desired  manipulation  of 
the  object  in  the  workspace.  Animated  graphical 
renderings  have  been  developed  to  visually  illus¬ 
trate  the  presence  or  absence  of  slipping  and 
crushing  during  an  active  manipulation  task.  The 
theoretical  results  can  be  used  to  generate  testable 
hypotheses  for  human  or  robot  experiments  on 
tactual  sense. 


1.11  Biomechanics  of  Skin-Object 
Contact 

Sponsor 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N0001 4-91 -J-1 454 

Project  Staff 

Dr.  Mandayam  A.  Srinivasan,  Jyh-Shing  Chen, 
Rogeve  J.  Gulati,  Louise  Jandura 

Although  physical  contact  is  ubiquitous  in  our  inter¬ 
actions  with  objects  in  the  environment,  we  do  not 
yet  understand  the  phenomena  occurring  at  the 
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skin-object  interface.  As  mentioned  before,  the 
spatio-temporal  distribution  of  mechanical  loads  on 
skin  at  the  contact  interface  is  the  source  of  all 
tactile  information.  These  loads,  specified  as  pres¬ 
sure,  displacements,  etc.,  depend  on  the  geomet¬ 
rical  and  material  properties  of  both  the  contacting 
entities  as  well  as  the  overall  forces  of  interaction. 
The  goal  of  this  project  is  to  determine  the  growth 
and  motion  of  contact  regions  and  associated  force 
variations  over  time  between  the  human  fingerpad 
and  carefully  chosen  transparent  test  objects  whose 
microtexture,  shape  or  softness  is  varied  in  a  con¬ 
trolled  manner.  These  results  are  being  used  to 
gain  a  deeper  understanding  of  the  data  we  have 
already  obtained  for  the  same  test  objects,  such  as 
electrophysiologically  recorded  responses  of  cuta¬ 
neous  mechanoreceptive  afferent  fibers,  and 
psychophysically  determined  human 

discriminabilities. 

Our  initial  goal  was  to  record  the  forces  and  video- 
images  of  contact  under  active  touch  when  subjects 
press  or  stroke  various  transparent  specimens 
mounted  on  a  force  transducer.  Accordingly,  we 
designed  a  videomicroscopy  system  and  a  force 
sensing  system,  both  interfaced  to  a  computer. 
The  videomicroscopy  system  consists  of  a  set  of 
video  zoom  lenses  attached  to  a  high-resolution 
CCD  camera  whose  output  can  either  be  digitized 
directly  at  about  5  frames/s  or  stored  on  a  laserdisk 
at  real-time  frame  rates  (30  frames/s)  for  off-line 
digitization.  The  zoom  lenses  enable  continuous 
variation  of  magnification  with  the  field  of  view  cov¬ 
ering  the  entire  fingerpad  or  only  a  few  fingerprint 
ridges.  High  contrast  images  are  achieved  with 
coaxial  lighting.  In  collaboration  with  our  colleagues 
at  the  Artificial  Intelligence  Laboratory  at  MIT,  we 
designed  and  built  two  6-axis  force  sensors  that  are 
customized  to  our  application.  These  sensors  have 
much  higher  resolutions  (10  bit)  than  commercial 
sensors  operating  in  comparable  range  of  forces  (5 
Newtons).  The  transparent  test  objects  can  be 
attached  to  these  sensors  for  both  biomechanical 
and  psychophysical  experiments. 

Using  the  test  facility  described  above,  we  have 
performed  a  set  of  experiments  with  human  sub¬ 
jects  to  investigate  the  relationship  between  the 
contact  force,  contact  area,  and  compliance  of  the 
object.  The  experiments  involved  active  indentation 
of  transparent  compliant  rubber  specimens  and  a 
glass  plate  with  the  subjects'  fingerpads.  Static 
video  images  of  the  contact  regions  were  captured 
at  various  force  levels  and  magnifications.  In  order 
to  minimize  the  effects  of  non-uniform  illumination, 
we  implemented  homomorphic  image  processing 
algorithms.  This  essentially  consisted  of  high-pass 
filtering  to  remove  the  low  frequency  (spatial)  illumi¬ 
nation  intensity  variations  so  that  only  the  higher 


frequency  variations  due  to  the  fingerprint  patterns 
remained  in  the  image.  For  high  magnification 
images,  since  the  spatial  frequency  of  non-uniform 
lighting  was  of  the  same  order  as  the  finger  ridge 
frequency,  image  decimation  techniques  had  to  be 
employed  in  addition  to  homomorphic  processing. 
The  processed  images  showed  that  contact  regions 
consisted  of  discontinuous  "islands"  along  each 
finger  ridge  with  clear  distinction  between  contact 
and  non-contact  regions  over  the  entire  field  of 
view. 

As  expected,  the  results  show  that  the  nominal 
overall  contact  area  increases  as  the  contact  force 
increases  for  all  the  specimens.  More  interestingly, 
the  percentage  of  actual  contact  area  relative  to  the 
nominal  (overall)  contact  area  also  increases  and  is 
a  function  of  the  object  compliance.  For  the  plane 
glass  plate,  the  percentage  contact  area  is  only 
around  18  percent  for  0.1  N  force  and  about  22 
percent  for  1 .5  N  force.  In  contrast,  the  percentage 
contact  area  for  a  compliant  rubber  specimen  is 
much  higher  than  that  for  the  plane  glass  plate, 
about  43  percent  for  0.1  N  force  and  more  than  80 
percent  for  1.5  N  force.  The  implication  to  tactual 
discrimination  of  softness  is  that  for  objects  whose 
compliances  can  be  discriminated  even  when  the 
overall  contact  areas  under  a  given  contact  force 
are  the  same,  the  actual  contact  areas  can  differ  by 
a  factor  of  two  or  more.  The  actual  pressure  dis¬ 
tribution,  which  acts  only  within  the  discontinuous 
contact  islands  on  the  skin,  will  therefore  be  radi¬ 
cally  different  for  the  different  objects.  Conse¬ 
quently,  a  spatio-temporal  neural  code  for  object 
compliance  emerges  with  far  higher  resolution  than 
an  intensive  code  such  as  the  average  pressure 
over  the  overall  contact  area.  These  results  are  in 
agreement  with  our  hypothesis  that  the  neural 
coding  of  objects  with  deformable  surfaces  (such  as 
rubber)  is  based  on  the  spatio-temporal  pressure 
distribution  on  the  skin.  This  was  one  of  the  con¬ 
clusions  from  our  psychophysical,  biomechanical, 
and  neurophysiological  experiments  in  a  companion 
project  conducted  in  collaboration  with  Professor 
R.H.  LaMotte  of  the  Yale  University  School  of  Medi¬ 
cine. 

In  addition  to  helping  us  develop  an  understanding 
of  the  processes  involved  in  the  peripheral  neural 
encoding  of  object  compliance  information,  the 
results  of  these  experiments  provide  testable 
hypotheses  for  investigating  cortical  representation 
of  this  information  and  the  processing  algorithms 
used  by  the  central  nervous  system  for  decoding 
the  peripheral  neural  signals.  These  results  are 
also  useful  in  the  design  of  tactile  displays  that 
convey  object  compliance  information  to  human 
operators  in  teleoperation  and  virtual  environment 
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systems.®®  The  results  of  the  experiments  also  help 
in  the  design  of  tactile  sensors  for  robots  and 
development  of  algorithms  to  decode  the  sensor 
information  on  object  compliance. 

1.12  Human  and  Robot  Hands: 
Mechanics,  Sensorimotor  Functions  and 
Cognition 

Sponsor 

U.S.  Navy  -  Office  of  Naval  Research 
Grant  N00014-92-J-1814 

Project  Staff 

Dr.  Mandayam  A.  Srinivasan,  Dr.  J.  Kenneth 
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Kiran  B.  Dandekar 

This  is  a  new  project  funded  by  the  University 
Research  Initiative  Program  of  the  Office  of  Naval 
Research  (ONR)  which  involves  several  investi¬ 
gators  from  MIT,  Harvard  University,  and  Yale  Uni¬ 
versity.  The  premise  is  that  humans  and  robots 
can  be  viewed  as  two  implementations  of  systems 
that  perform  tactual  sensorimotor  tasks  under  a 
common  set  of  constraints.  By  devising  a  research 
plan  for  the  study  of  human  and  robot  hands  that 
focuses  on  the  commonalities  rather  than  the  differ¬ 
ences,  we  propose  to  achieve  mutually  beneficial 
advances  in  both  fields.  Specifically,  we  propose  to 
carry  out  a  program  of  research  with  the  following 
related  thrusts.  We  will: 

1 .  Construct  a  high  precision  "glass  smooth"  robot 
which  will  serve  as  the  core  of  both  human  and 
robotic  experiments.  This  robot  will  have  an 
integrated  design  of  tactile  sensing  and 
mechanical  subsystems.  Initially,  the  sensors 
will  include  force  and  motion  sensors;  subse¬ 
quently,  we  will  incorporate  a  newly  developed 
tactile  array  using  micromechanics  techniques. 

2.  Carry  out  a  suite  of  experiments  aimed  at  (1) 
determining  the  abilities  and  limitations  of  the 
sensorimotor  functions  of  human  and  robot 
hands  and  (2)  understanding  the  mechanisms 
and  information  representations  they  employ. 
We  propose  to  design  and  fabricate  several 
experimental  stations  based  on  robotic  tech¬ 
nology  for  conducting  a  wide  range  of 


biomechanical,  neurophysiological,  and  psycho¬ 
physical  experiments  on  human  and  animal 
tactual  systems  as  well  as  measurements  on 
robotic  prototypes. 

3.  Develop  methods  applicable  to  both  humans 
and  robots  for  integrating  tactual  data  from 
multiple  types  of  sensors,  multiple  spatially 
separated  sensors,  and  sequences  of  multiple 
contacts  to  achieve  a  consistent  perception  of 
the  external  environment.  We  will  explore 
methods  for  using  this  integrated  information 
for  carrying  out  exploratory  and  manipulation 
tasks  as  well  as  inferring  object  structure  and 
function. 

4.  Articulate  design  criteria  for  robot  sy.  ms  to 
be  employed  in  tactual  tasks.  We  will  test 
these  criteria  in  a  number  of  robotic  testbeds. 

During  the  past  few  months,  a  two  degree  of 
freedom  robot  capable  of  motion  in  a  plane  has 
been  fabricated  and  assembled  and  is  currently 
being  tested  for  use  as  a  tactile  stimulator.  The  ori¬ 
ginal  version  of  the  robot  was  designed  by  Pro¬ 
fessor  R.  Howe  of  Harvard  University  and  minor 
modifications  were  made  to  enhance  its  use  as  a 
tactile  stimulator.  The  stimulator  can  be  operated 
under  both  position  and  force  control.  It  will  be  used 
to  conduct  a  variety  of  biomechanical  and  psycho¬ 
physical  experiments  to  investigate  human  tactile 
sense. 

In  collaboration  with  Dr.  J.K.  Salisbury  of  MIT  and 
his  students,  the  first  generation  design  of  an 
Instrumented  Screwdriver  (ISD)  has  been  com¬ 
pleted,  and  its  fabrication  is  undenway.  This  device 
will  simulate  the  task  of  using  a  screwdriver  in  order 
to  test  the  force  control  capability  of  humans.  The 
"target  screw"  is  simulated  by  a  computer-controlled 
magnetic  particle  brake.  For  example,  during  the 
performance  of  force  tracking  tasks  by  a  subject, 
the  force  vector  under  each  of  two  or  three 
fingerpads  can  be  measured  by  custom-designed 
six-axis  force  sensors.  The  subject  will  perform  the 
tasks  using  active  touch  with  his  or  her  fingerpads 
under  both  normal  and  locally  anesthetized  condi¬ 
tions  to  isolate  the  sources  of  tactual  information. 
We  have  completed  a  set  of  similar  force  tracking 
experiments  using  a  stationary  six-axis  force  sensor 
and  the  data  is  being  analyzed.  Preliminary  results 
show  that  local  anesthesia,  which  blocks  tactile 
information  from  cutaneous  mechanoreceptors. 


58  m.A.  Srinivasan,  sections  on  haptic  perception  and  haptic  interfaces,  in  Research  Directions  in  Virtual  Environments:  Report  of  an 
NSF  Invitational  Workshop,  Computer  Graphics,  ed.  G.  Bishop  et  al.  26(3):  153-177  (1992);  M.A.  Srinivasan  and  J.K.  Salisbury, 
sections  on  haptics,  in  Virtual  Environment  Technology  for  Training,  BBN  Report  No.  7661,  Prepared  by  the  Virtual  Environment  and 
Teleoperator  Research  Consortium  (VETREC)  affiliated  with  MIT.  1992. 
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does  not  seriously  impair  the  performance  of  the 
subjects.  This  implies  that  among  the  tactile  and 
kinesthetic  sources  of  force  information,  humans 
may  be  relying  on  the  kinesthetic  channel  for  force 
control,  even  though  the  tactile  channel  has  better 
sensory  resolution. 
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1.1  Introduction 

Research  on  the  auditory  system  is  carried  out  in 
cooperation  with  two  laboratories  at  the  Massachu¬ 
setts  Eye  and  Ear  Infirmary  (MEEI).  Investigations 
of  signal  transmission  in  the  auditory  system  involve 
the  Eaton-Peabody  Laboratory  for  Auditory  Physi¬ 
ology,  whose  long-term  objective  is  to  determine 
the  anatomical  structures  and  physiological  mech¬ 
anisms  that  underlie  vertebrate  hearing  and  to 
apply  this  knowledge  to  clinical  problems.  Studies 
of  cochlear  implants  in  humans  are  carried  out  at 
the  MEEI  Cochlear  Implant  Research  Laboratory. 
The  ultimate  goal  for  these  devices  is  to  provide 
speech  communicati  n  for  the  deaf.  Cochlear 
implants  electrically  stimulate  intracochlear  elec¬ 
trodes  to  elicit  patterns  of  auditory  nerve  fiber 
activity  that  the  brain  can  learn  to  interpret. 


1.2  Signal  Transmission  in  the  External 
and  Middle  Ear 

The  goal  of  this  work  is  to  understand  the  relation¬ 
ship  between  the  structure  and  function  of  the 
external  and  middle  ear. 


1.2.1  Structure-Function  Relations  in 
Middle  Ears 

Sponsors 

National  Institutes  of  Health 
Grant  R01 -DC-001 94 

Project  Staff 

Dr.  John  J.  Rosowski,  Professor  William  T.  Peake, 
David  A.  Steffens 

In  this  project,  we  have  determined  how  structural 
variations  in  the  middle-  and  external  ears  of 
various  vertebrate  species  contribute  to  differences 
in  auditory  function.  Our  work  in  this  area  has 
taken  two  interrelated  paths.  One  path  has  been  to 
make  acoustic  measurements  of  middle-  and 
external-ear  function  in  the  gerbil,  a  small  mammal 
with  a  highly  specialized  middle-ear  air  space  and 
life  style.'  Measurements  of  middle-ear  input 
impedance  were  made  before  and  after  manipu¬ 
lations  of  the  middle-ear  air  space  (figure  1).  The 
impedance  measured  with  the  air  spaces  sealed, 
vented,  partially  opened  and  widely  opened  are  all 
consistent  with  a  model  in  which  the  impedance  of 


'  M.E.  Ravicz,  J.J.  Rosowski  and  H.F.  Voigt,  "Sound-power  Collection  by  the  Auditory  Periphery  of  the  Mongolian  Gerbil  Meriones 
unguiculatus.  I:  Middle-ear  Input  Impedance."  J.  Acoust  Soc.  Am.  92: 157-177  (1992). 
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the  middle-ear  air  spaces  is  in  series  with  the  rest 
of  the  middle-ear  (figure  2).  Furthermore,  the  data 
and  model  analysis  indicate  that  the  air-space 
impedance  is  primarily  reactive  except  at  the  lowest 
frequencies.  When  extrapolated  to  human  ears, 
these  results  can  be  used  to  predict  the  effects  of 
certain  middle-ear  pathologies. 

The  second  method  we  use  to  investigate 
middle-ear  structure  and  function  is  comparison  of 
the  dimensions  of  various  middle-ear  structures  in 
mammals  with  known  auditory  capabilities.  The 
results  of  this  analysis  illustrate  that  mammals  with 
good  hearing  sensitivity  at  frequencies  less  than  1 
kHz  tend  to  have  larger  middle-ear  dimensions 
while  those  species  with  sensitive  hearing  restricted 
to  the  high  and  ultrasonic  frequencies  (f  >  10  kHz) 
have  smaller  middle-ear  dimensions.  The  results  of 
this  analysis  have  been  used  to  predict  the  auditory 
capabilities  of  very  early  fossil  mammals.^  These 
predictions  suggest  that  the  small  shrew-like 
mammals  that  existed  at  the  reptile-mammal  transi¬ 
tion  200  million  years  ago  were  not  sensitive  to  low- 
frequency  sounds  as  modern  reptiles  are  but,  like 
modern  shrews  and  rats,  were  most  sensitive  to 
high-frequency  sounds. 


1.2.2  Basic  and  Clinical  Studies  of 
Middle-Ear  Function 

Sponsor 

National  Institutes  of  Health 
Contract  P01-DC-00119 
Fellowship  F32-DC00073 

Project  Staff 

Dr.  John  J.  Rosowski,  Professor  William  T.  Peake, 
Dr.  Sunil  Puria 

Our  goal  is  to  understand  the  relationship  between 
the  structure  and  function  of  auditory  periphery 
including  definition  of  the  effect  of  middle-  and 
external-ear  pathologies  on  auditory  function.  To 
help  achieve  this  goal,  we  are  measuring  function  in 
human  ears  using  temporal-bones  donated  at  the 
time  of  death,  and  we  have  applied  models  from 
our  animal  work  to  questions  concerning  middle-ear 
pathologies  and  treatments. 


Figure  1.  Measurements  of  middle-ear  input  impedance 
in  a  gerbil  ear  before  and  after  various  manipulations  of 
the  middle-ear  air  spaces:  sealed,  the  air  spaces  are 
closed;  vented,  a  long-narrow  tube  is  placed  through  the 
bony  walls  of  the  air  space,  the  tube  vents  any  excess 
pressure  in  the  air  space  to  the  atmosphere;  hole  open,  a 
small  2  mm  diameter,  hole  is  made  in  the  air  space  walls; 
wide  open,  the  bony  walls  of  the  air  space  are  widely 
opened. 

In  conjunction  with  Dr.  Saumil  Merchant,  a  physi¬ 
cian  member  of  the  Otolaryngology  Department  at 
the  Massachusetts  Eye  and  Ear  Infirmary,  measure¬ 
ments  were  made  of  the  cochlear  input  impedance 
(the  acoustic  load  on  the  middle  ear)  in  seven  fresh 
and  thawed  temporal-bone  specimens.  Preliminary 
results  of  these  measurements^  suggest  that  the 
cochlear  impedance  is  loosely  approximated  by  a 
series  combination  of  a  cochlear  resistance  and  the 
stiffness  of  the  oval  window  membrane  (annular 
ligament)  at  the  entrance  to  the  inner  ear,  which 
has  been  observed  in  animal  ears.  However,  these 
measurements  also  indicate  a  resistive  component 
to  the  annular-ligament  impedance  which  has  not 
been  seen  in  animal  work.  This  work  is  being 
expanded  to  include  measurements  of  the 


2  J.J.  Rosowski,  "Hearing  in  Transitional  Mammals:  Predictions  from  the  Middle-ear  Anatomy  and  Hearing  Capabilities  of  Extant 
Mammals,"  in  The  Evolutionary  Biology  of  Hearing,  eds.  D.B.Webster,  R.R.  Fay  and  A.N.  Popper  (New  York:  Springer-Verlag,  1992) 
pp.  615-631. 

3  S.N.  Merchant,  M.E.  Ravicz  and  J.J.  Rosowski,  "The  Acoustic  Impedance  of  the  Stapes  and  Cochlea  in  Human  Temporal  Bones," 
Abstracts  of  the  15th  Mid-Winter  Research  Meeting  of  the  Association  for  Research  in  Otolaryngology,  pg.  98  (1992). 
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Figure  2.  (a)  The  series  model  of  the  middle  ear, 

relating  the  middle-ear  input  impedance  ZT  to  the 
impedance  of  the  air  spaces  ZMEC,  and  the  rest  of  the 
middle  ear  ZTOC.  (b)  Model  interpretations  of  the  effect 
of  sealing  the  air  spaces.  Moving  the  switch  on  the  right 
to  each  of  the  four  possible  positions  models:  a  sealed 
middle-ear  air  space  S,  venting  V,  opening  a  small  hole 
H.  and  widely  opening  O.  ZV  is  the  impedance  of  the 
vent  tube  and  the  series  combination  of  MH,  and  RH  is 
the  acoustic  impedance  of  the  small  hole. 

middle-ear  transfer  function  in  isolated  temporal 
bones. 

Measurements  and  models  from  our  animal  work 
were  used  to  produce  a  model  (figure  3)  of  the 
effect  of  direct  acoustic  stimulation  of  the  cochlear 
windows,  which  bypass  the  tympanic  membrane 
and  ossicles  in  normal  and  pathological  human 
ears."*  Analysis  of  the  model  demonstrates  that  the 
alternate  "acoustic"  pathway  for  sound  entry  to  the 
inner  ear  is  not  important  in  normal  auditory  func¬ 
tion  but  may  be  a  major  component  of  that  function 
in  pathological  ears  in  which  the  normal  conduction 
mechanism  is  interrupted.  Comparisons  of  the 
model's  predictions  of  hearing  function  in  patholog¬ 
ical  cases  with  patient  audiograms  support  the 
hypothesized  role  of  direct-acoustic  stimulation  in 
pathological  ears  (figure  4). 


Figure  3.  A  model  of  the  combination  of  "ossicular"  and 
"acoustic"  middle-ear  sound  transmission.  The  ossicular 
path  models  motions  of  the  tympanic  membrane  and 
ossicles  that  produce  a  pressure  on  the  stapes  and 
cochlea  PIS.  The  effective  stimulus  from  the  "acoustic" 
path  is  the  sound  pressure  difference  acting  on  the  inner 
ear  windows  PWD. 

1.3  Cochlear  Mechanisms 

Sponsor 

National  Institutes  of  Health 
Contract  P01-DC00119 
Grant  R01-DC00238 
Grant  R01-DC00473 

Project  Staff 

Professor  Thomas  F.  Weiss,  Dr.  Dennis  M. 
Freeman,  Lisa  F.  Shatz,  Charles  Q.  Davis,  Charles 
C.  Abnet,  Henry  E.  Chung 

Our  goal  is  to  study  the  mechanisms  by  which 
motions  of  macroscopic  structures  in  the  inner  ear 
produce  motions  of  the  mechanically  sensitive  hair 
bundles  of  sensory  receptor  (hair)  cells.  Because 
of  its  strategic  location,  the  tectorial  membrane 
must  play  an  important  role  in  the  mechanical  stim¬ 
ulation  of  hair  bundles.  However,  there  have  been 
few  direct  observations  of  the  tectorial  membrane, 
and  its  critical  properties  remain  obscure.  During 
the  past  year,  we  have  continued  our  efforts  to 
measure  the  physicochemical  properties  of  the 
tectorial  membrane. 

Changes  in  the  ionic  composition  of  the  bath  induce 
changes  in  the  volume  of  the  isolated  tectorial 


*  W.T.  Peake,  J.J.  Rosowski,  and  T.J.  Lynch, 
Hear.  Res.  57:  245-268  (1992). 


"Middle-ear  Transmission:  Acoustic  Versus  Ossicular  Coupling  in  Cat  and  Human,’ 
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Figure  4.  Comparisons  of  the  predictions  from  the  model 
of  figure  3  and  measured  audiograms  for  cases  of:  (a) 
interrupted  incudo-stapedial  joint,  (b)  missing  tympanic 
membrane,  malleus  and  incus,  and  (c)  direct  acoustic 
stimulation  of  the  oval  window  only,  a  condition  that 
mimics  a  Type  IV  tympanoplastic  procedure  to  recon¬ 
struct  the  middle  ear  after  severe  middle-ear  disease. 
The  dashed  lines  illustrate  normal  hearing  levels. 
Hearing  losses  are  illustrated  by  increased  hearing  levels. 

membrane  of  the  chick.®  The  volume  changes  can 
be  large,  often  increasing  the  thickness  of  the 
tectorial  membrane  by  more  than  a  factor  of  two, 
and  have  a  slow  time  course,  often  continuing  for 
hours.  Although  many  of  the  changes  are  reversed 


when  the  original  bath  composition  is  restored,  per¬ 
manent  changes  in  the  structure  of  the  tectorial 
membrane  can  result  from  long  exposures  to  high- 
sodium,  low-calcium  solutions. 

During  the  past  year,  we  have  studied  the  effects  of 
brief  (15  minute)  exposures  to  test  solutions.  We 
have  found  (1)  that  responses  to  brief  exposures 
are  consistent  with  those  for  long  exposures  and  (2) 
that  the  effects  of  brief  exposures  are  more  nearly 
reversible.  Therefore,  we  have  been  able  to 
measure  reponses  to  brief  exposures  of  as  many 
as  20  different  ionic  solutions  in  a  single  prepara¬ 
tion.  Results  show  that  even  very  low  levels  of 
calcium  (<  10  |imol/L)  significantly  change  the 
structure  of  the  tectorial  membrane. 

Because  of  the  considerable  diversity  in  tectorial 
membrane  structure  across  species,  it  is  important 
to  compare  results  obtained  for  chick  tectorial  mem¬ 
brane  with  results  from  other  species.  During  the 
past  year,  we  have  initiated  studies  of 
physicochemical  properties  of  the  tectorial  mem¬ 
brane  of  the  alligator  lizard — a  species  in  which  the 
importance  of  mechanical  processes  at  the  level  of 
individual  hair  bundles  has  been  directly  demon¬ 
strated.  Initial  results  suggest  that  the  tectorial 
membrane  of  the  lizard  (1)  swells  when  high- 
potassium,  low-calcium  solutions  are  replaced  by 
high-sodium,  low-calcium  solutions  and  (2)  shrinks 
when  calcium  is  increased  from  nmol/L  to  mmol/L 
concentrations.  These  results  are  qualitatively 
similar  to  results  with  chick  tectorial  membrane. 
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We  have  been  investigating  the  correspondence 
between  human  auditory  psychophysics  and  neural 
activity  patterns  at  the  level  of  the  auditory  nerve 
and  the  cochlear  nucleus.  We  want  to  gain  insight 
into  what  kinds  of  information  the  auditory  system 
uses  to  represent  complex  acoustic  stimuli.  We 
have  recorded  from  representative  samples  of 
auditory  nerve  fibers  in  anesthetized  cats  and  com¬ 
puted  autocorrelation  histograms  to  estimate  the 
aggregate  interspike  interval  distribution  over  the 
entire  auditory  nerve.  Our  results  for  a  wide  range 
of  complex  stimuli  suggest  that  the  pitch  heard  by 
human  listeners  corresponds  to  the  most  frequent 
interspike  interval  in  the  auditory  nerve. 

In  the  past  year,  we  ran  a  physiological  analog  of  a 
classical  psychophysical  experiment  in  which  ail 
three  components  of  a  harmonic  complex  are 
shifted  in  frequency  by  the  same  amount,  resulting 
in  a  shift  in  perceived  pitch.  Our  results  showed 
that  the  pitch  shifts  estimated  from  the  interspike 
interval  distributions  closely  followed  those 
observed  in  psychophysical  studies.  We  also 
carried  out  a  physiological  analog  of  a  pitch 
dominance  experiment  in  which  two  harmonic  com¬ 
plexes  (harmonics  3-5  versus  harmonics  6-12)  with 
slightly  different  fundamental  frequencies  (FO)  were 
presented  together.  At  FOs  above  150  Hz,  the 
most  frequent  interspike  interval  in  the  auditory 
nerve  always  corresponded  to  the  pitch  associated 
with  harmonics  3-5  rather  than  the  pitch  of  har¬ 
monics  6-12,  consistent  with  human  psychophysical 
data.  These  results  show  that  models  of  pitch  per¬ 
ception  based  on  aggregate  interspike  interval  dis¬ 
tributions  can  account  for  classical  experimental 
results  for  pitch  shift  and  pitch  dominance. 

Psychophysical  experiments  show  that  listeners  can 
identify  better  both  constituents  of  a  pair  of 
simultaneously-presented  vowels  if  the  pitches  of 
the  two  vowels  are  slightly  different.  In  a  physiolog¬ 
ical  experiment  motivated  by  this  result,  we  have 
found  that  (1)  each  single  vowel  evokes  a  charac¬ 
teristic  aggregate  interspike  interval  distribution,  (2) 
one  single  vowel  constituent  of  a  double  vowel  pair 
can  always  be  identified  from  correlations  between 
the  double  vowel  pattern  and  the  various  single 
vowel  patterns,  (3)  the  second  single  vowel  constit¬ 
uent  can  often  be  identified  if  the  first  correctly  iden¬ 
tified  vowel  is  "subtracted"  from  the  double  vowel 
pattern  (via  partial  correlation),  and  (4)  vowel  iden¬ 
tification  through  this  method  improves  when  the 
voice  pitches  are  made  slightly  different.  These 
results  suggest  that  vowels  can  be  discriminated 
using  aggregate  temporal  information  alone  and 


that  explicit  pitch  identification  may  not  be  neces¬ 
sary  to  account  for  improved  identification  of  con¬ 
stituent  vowels  with  increasing  voice  pitch 
differences. 

When  two  sounds  with  different  pitches  are  played 
simultaneously,  certain  pitch  ratios  appear  to  be 
pleasant  or  consonant,  while  others  appear  harsh 
or  dissonant.  In  order  to  investigate  neural  corre¬ 
lates  of  these  phenomena,  we  have  recorded  the 
responses  of  auditory-nerve  to  stimuli  associated 
with  different  degrees  of  consonance  and 
dissonance.  Response  patterns  to  pairs  of  pure 
tones  forming  dissonant  intervals  showed  beating 
for  all  responding  fibers.  No  beating  patterns  were 
found  for  pure-tone  consonant  intervals.  Beating 
patterns  were  also  found  for  dissonant  pairs  of 
complex  tones,  but  were  restricted  to  fibers  whose 
characteristic  frequencies  were  close  to  two  partials 
of  the  complex  tones.  These  beating  patterns  in 
the  frequency  range  20-200  Hz  are  thought  to  be 
physiological  correlates  of  roughness.  A  quantita¬ 
tive  model  that  sums  the  physiologic  roughness 
across  the  array  of  auditory-nerve  fibers  can  predict 
the  relative  dissonances  of  the  different  musical 
intervals.  These  results  suggest  that  certain 
aspects  of  harmony  perception  are  encoded  in  the 
temporal  patterns  of  discharge  of  auditory-nerve 
fibers. 
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1.4  Electrical  Stimulation  of  the 
Auditory  Nerve 
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The  goal  of  these  studies  is  to  understand  the 
mechanisms  of  electrical  stimulation  of  the  auditory 
nerve  to  design  better  cochlear  implants.  For  this 
purpose,  we  record  from  auditory-nerve  fibers  in 
anesthetized  cats  in  response  to  electric  stimuli 
applied  through  electrodes  similar  to  those  used  in 
humans.  In  the  past  year,  we  have  measured  the 
responses  to  pairs  of  brief  monophasic  pulses  sep¬ 
arated  by  short  intervals.  This  stimulus  is  the  sim¬ 
plest  configuration  that  can  show  some  of  the 
interpulse  interactions  that  are  likely  to  occur  in 
interleaved  sampling  speech  processors.  For 
suprathreshold  first  pulses,  the  threshold  for  the 
second  pulse  was  found  to  decrease  with 
increasing  interpulse  delay,  as  expected  from  the 
refractory  properties  of  nerve  fibers.  Interpulse 
delays  at  thresholds  were  consistently  shorter  for  a 
cathodal  first  pulse  than  for  an  anodal  first  pulse 
with  the  same  amplitude.  (The  second  pulse  was 
always  anodal.)  This  latency  difference  can  be 
interpreted  as  the  spike  conduction  time  from  a 
peripherally-located  cathodal  site  of  excitation  to  the 
more  central  anodal  site  of  excitation.  Consistent 
with  this  interpretation,  the  difference  in  latencies 
between  cathodal  and  anodal  single  pulses  was 
approximately  equal  to  the  conduction  time  esti¬ 
mated  from  the  two-pulse  experiment.  We  plan  to 
pursue  these  experiments  to  identify  the  different 
sites  of  nerve  excitation  for  various  stimulus  condi¬ 
tions. 

Another  series  of  experiments  examined  whether 
responses  to  two-pulse  stimuli  can  be  used  to 
predict  responses  for  more  complex  stimulus  condi¬ 
tions.  Three  pulses  of  equal  amplitude  were  pre¬ 
sented  in  rapid  succession.  The  pulse  amplitude 
was  chosen  so  that  each  pulse  would,  by  itself,  be 
well  above  threshold.  The  delay  between  Pulses  1 
and  2  was  adjusted  so  that  Pulse  2  would  just 
produce  a  spike  with  high  probability,  and  the  delay 
between  Pulses  2  and  3  was  adjusted  so  that  Pulse 
3  would  meet  the  same  criterion.  The  delay 
between  Pulses  2  and  3  was  found  to  be  consist¬ 
ently  longer  than  the  delay  between  Pulses  1  and 


2.  This  result  shows  that  the  refractory  period 
cannot  be  seen  as  a  fixed  characteristic  of  a  neuron 
because  it  depends  on  past  history  of  stimulation. 
Model  simulations  show  that  the  magnitude  of  the 
changes  in  refractory  period  are  severely  underesti¬ 
mated  by  classical  Hodgkin-Huxley  dynamics.  This 
result  has  implications  for  the  design  of  interleaved 
sampling  speech  processors  in  which  one  attempts 
to  minimize  interactions  between  successive  pulses. 
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1.5.1  Basic  and  Clinical  Studies  of  the 
Auditory  System 

Our  aim  is  to  determine  the  actions  and  interactions 
of  the  acoustically  elicited  middle-ear  muscle 
reflexes  and  the  olivocochlear  efferent  reflexes.^ 

As  a  first  step  in  our  project  to  measure  the  sound- 
level  dependence  of  the  middle-ear  acoustic 
reflexes  with  and  without  activation  of  medial 
olivocochlear  (MOC)  efferents,  we  have  begun 
studying  MOC  effects  on  brain  stem  evoked  poten¬ 
tials  (BAEPs).  In  anesthetized  cats,  we  measured 
BAEPs  at  the  vertex  in  response  to  clicks  at  a 
variety  of  sound  levels,  with  and  without  activation 
of  MOC  efferents  and  before  and  after  the  efferents 
were  cut  near  the  vestibulo-cochlear  anastomosis. 
This  cut  severs  the  axons  to  the  cochlea  but  leaves 
the  collaterals  to  the  cochlear  nucleus  intact. 

Before  the  cuts,  stimulation  of  MOC  efferents  with 
an  electrode  at  the  midline  of  the  floor  of  the  fourth 
ventricle  shifted  the  BAEP  responses  to  higher 
sound  levels  by  approximately  the  same  amount  as 
auditory-nerve  responses  (N1)  were  shifted  to 
higher  sound  levels.  After  the  cuts,  no  changes  in 
BAEPs  were  produced  by  MOC  stimulation.  Thus, 
from  experiments  to  date,  the  effects  of  efferent 
stimulation  on  BAEPs  appear  to  be  due  solely  to 
MOC  inhibition  of  cochlear  responses  with  no  effect 
attributable  to  the  MOC  collaterals  to  the  cochlear 


7  This  project  is  the  direct  successor  of  the  1991  project;  Middle-Ear  Muscle  Reflex. 
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nucleus.  The  lack  of  effect  of  MOC  collaterals  to 
the  cochlear  nucleus  on  BAEPs  is  consistent  with 
these  collaterals  not  innervating  bushy  cells  in  the 
cochlear  nucleus®  and  the  hypothesis  that  bushy 
cells  are  the  dominant  cochlear-nucleus  cell  type 
involved  in  producing  BAEPs.®  Additionally,  the 
absence  of  changes  in  BAEPs  due  to  severing 
efferents  near  the  vestibulo-cochlear  anastomosis 
means  that  measurements  of  BAEPs  can  serve  as 
a  good  control  for  the  integrity  of  the  auditory 
brainstem  in  experiments  on  middle-ear  muscle 
reflexes  involving  such  efferent  cuts. 

During  the  past  year,  final  manuscript  revision  was 
done  for  publication  of  our  re.sults  on  the  sound- 
frequency  selectivity  of  single  stapedius 
motoneurons  in  cats  which  has  now  appeared  in 
the  Journal  of  Neurophysiology. 
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1.6.1  Olivocochlear  Efferent  Systems 

Our  aim  is  to  understand  the  physiological  effects 
produced  by  medial  olivocochlear  efferents  which 
terminate  on  outer  hair  cells  in  the  mammalian 
cochlea. 

During  the  past  year,  we  made  a  serendipitous  dis¬ 
covery  that  some  vestibular  afferent  fibers  respond 
to  sound  at  high  sound  levels.  An  earlier  study  on 
the  responses  of  mammalian  vestibular  fibers  to 
sound  stimuli’ '  concluded  that  sound  does  not 
increase  the  firing  rate  of  vestibular  fibers  until 
sound  levels  of  120  dB  SPL  or  more  which  is 
outside  of  the  normal  range  of  hearing.  That  work 
led  to  the  widespread  assumption  that  the  auditory 
responses  of  vestibular  fibers  play  no  role  in 
hearing.  The  fibers  we  have  found  respond  to  90 
dB  SPL  tones.  This  is  a  high  sound  level,  but  is 
not  outside  the  range  of  every-day  hearing.  (It  is 
approximately  the  level  of  a  loud  shout  or  the  level 
at  which  middle-ear  muscles  begin  to  contract.) 
Our  result,  coupled  with  the  finding  of  Kevetter  et 
al.’®  that  showed  that  some  vestibular  fibers  in 
gerbils  innervate  both  vestibular  brain-stem  nuclei 
and  parts  of  the  cochlear  nucleus,  suggests  that  the 
vestibular-auditory  fibers  which  we  have  discovered 
might  play  a  role  in  hearing  at  high  sound  levels. 
We  recorded  from  single  units  in  the  inferior 
vestibular  nerve  of  barbiturate-anesthetized  cats. 
The  vestibular-auditory  fibers  (1)  respond  to  sound 
with  a  latency  of  2-4  ms.,  (2)  have  broad  tuning 
curves  with  CFs  around  1  kHz,  (3)  have  low  spon¬ 
taneous  firing  rates  (0-5/sec),  (4)  have  increased 
firing  in  response  to  efferent  shocks  (like  other 
vestibular  fibers  but  opposite  to  the  effect  of 
efferents  on  cochlear-nerve  fibers),  and  (5)  have 
bipolar  ganglion  cells  in  the  inferior  division  of 
Scarpa's  ganglion  with  peripheral  processes 
extending  into  the  saccular  nerve  and  central  pro¬ 
cesses  entering  the  brain  stem  in  the  vestibular 
nen/e  root  (from  single-fiber  labeling  of  four 
neurons).  In  many  lower  vertebrates  the  saccule  is 
both  a  hearing  organ  and  a  vestibular  organ,  and  it 
may  serve  as  both  in  mammals.  A  preliminary 
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version  of  this  work  will  be  presented  early  in 

1993.’3 
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The  basic  function  of  a  cochlear  prosthesis  is  to 
elicit  patterns  of  activity  on  the  array  of  surviving 
auditory  nerve  fibers  by  stimulating  electrodes  that 
are  placed  in  and/or  around  the  cochlea.  By  modu¬ 
lating  these  patterns  of  neural  activity,  these 
devices  attempt  to  present  information  that  the 
implanted  subject  can  learn  to  interpret.  The  spike 
activity  patterns  elicited  by  electrical  stimulation 
depend  on  several  factors:  the  complex,  electrically 
heterogeneous  structure  of  the  cochlea,  the  geom¬ 
etry  and  placement  of  the  stimulating  electrodes, 
the  stimulus  waveform,  and  the  distribution  of 
excitable  auditory  nerve  fibers.  An  understanding 
of  how  these  factors  interact  to  determine  the 
activity  patterns  is  fundamental  to  designing  better 
devices  and  to  interpreting  the  results  of  exper¬ 
iments  involving  intracochlear  stimulation  of  animal 
and  human  subjects.  As  a  first  step  towards  this 
understanding,  the  goal  of  this  project  is  to  con¬ 
struct  a  software  model  of  the  cochlea  that  predicts 
the  distribution  of  potential  produced  by  the  stimu¬ 


lation  of  arbitrarily  placed,  intracochlear  electrodes 
and  use  these  potential  distributions  as  inputs  that 
drive  models  of  auditory  nerve  fibers. 

This  year  we  continued  the  development  of  the 
three-dimensional,  finite  difference  model  of  the 
human  cochlea  that  predicts  the  potential  distribu¬ 
tion  produced  in  this  structure  by  electrical  stimu¬ 
lation  using  model  electrodes  of  arbitrary  position 
and  geometry.  In  order  to  make  the  use  of  this 
model  more  practical  by  reducing  the  time  to 
compute  a  solution,  the  model  was  implemented  on 
a  SUN  SPARCstation-2  where  solutions  are  com¬ 
puted  in  twenty  minutes  rather  than  the  eight  hours 
required  by  the  original  VAX-750  system. 

A  series  of  solutions  were  computed  to  test  the 
model’s  sensitivity  to  variations  in  the  resistivity  of 
the  major  cochlear  components  (e.g.,  perilymph, 
endolymph,  bone,  nerve  and  basilar  membrane). 
These  solutions  documented  the  stability  of  the 
model  and  confirmed  our  intuition  that  the  asym¬ 
metric  potential  distributions  predicted  by  the  model 
and  reported  in  previous  years  reflect  the  basic, 
three-dimensional  anatomy  of  this  structure. 

We  are  in  the  final  stages  of  the  morphological 
work  on  one  rat  ear  that  will  represent  the  anatom¬ 
ical  component  of  the  electro-anatomical  model  for 
this  animal.  From  registered  serial  sections  of  the 
cochlea-  we  are  currently  generating  the  database 
through  which  the  tissues  and  their  electrical  resist¬ 
ances  are  represented  in  the  three-dimensional 
model.  Using  the  rat,  we  will  be  able  to  make  alter¬ 
ations  in  the  anatomy  of  the  cochlea  and  make 
measurements  that  would  je  impossible  in  our 
human  subjects.  This  will  provide  an  important  tool 
in  the  future  testing  and  refinement  of  this 
electroanatomical  model. 

Continued  development  of  a  single-fiber  computer 
model  of  myelinated  excitation  by  electrical  stimu¬ 
lation  has  led  us  to  model  formulations  that  more 
accurately  predict  the  following  measurements  in 
the  cat:  (1)  strength-duration  time  constants  for 
cathodal  stimulation  of  260  us  instead  of  the  80  us 
predictea  by  the  classical  model,  (2)  anodal/cath- 
odal  threshold  ratios  between  1  and  4  instead  of 
the  4.9  predicted  by  the  classical  model,  (3)  con¬ 
duction  velocity  4-6  m/sec/micron  that  are  greater 
than  found  in  the  classical  model,  and  (4)  refractory 
periods  that  are  three  times  shorter  than  predicted 
by  the  classical  model.  The  anodal/cathodal 
threshold  predictions  have  also  been  confirmed 
qualitatively  by  measurements  made  in  the  rat. 


'3  M.P.  McCue  and  J.J.  Guinan,  Jr., "  Acoustic  Responses  from  Primary  Afferent  Neurons  of  the  Mammalian  Sacculus,"  Assoc,  for  Res. 
in  Otolaryngol.,  Abstr,  16,  forthcoming. 
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1.7.2  Project  B:  New  Sound  Processors  for 
Auditory  Prostheses 

Project  Staff 

Dr.  Donald  K.  Eddington,  Dr.  William  M.  Rabinowitz, 
Joseph  Tierney,  Marc  A.  Zissman 

Our  collaboration  with  colleagues  at  Duke  Univer¬ 
sity  and  the  Research  Triangle  Institute  in  the 
development  of  new  processing  algorithms  to 
improve  the  speech  reception  of  cochlear  implant 
users  has  resulted  in  laboratory-based  systems  that 
increase  the  average  single-syllable  word  recogni¬ 
tion  scores  by  20  percentage  points  for  the  seven 
subjects  tested.’"'  In  addition  to  increasing  the 
number  of  stimulating  channels  from  four  to  six  and 
implementing  a  different  compression  technique, 
the  new  algorithms  were  designed  to  reduce  inter¬ 
ference  assumed  to  be  caused  by  simultaneously 
activating  intracochlear  electrodes. 

As  a  first  step  in  our  investigation  to  determine 
which  of  the  differences  between  the  new  and  old 
sound  processing  systems  contributed  to  increased 
performance,  we  compared  a  subject's  speech 
reception  ability  using  three  sound  processing 
schemes.  These  schemes  were:  (1)  the  current 
ccmmercial  processor  that  uses  four  bandpass 
filters  to  segment  the  frequency  spectrum  from 
approximately  100  Hz  to  4,000  Hz  and  presents  the 
analogue  outputs  of  these  filters  as  simultaneous 
current  waveforms  to  four  of  the  implanted  elec¬ 
trodes,  (2)  a  4-channel  processor  in  which  the 
envelopes  of  the  four  bandpass  filter  outputs  are 
used  to  modulate  biphasic  pulse  trains  that  are  pre¬ 
sented  as  interleaved  stimuli  to  four  of  the 
implanted  electrodes,  and  (3)  a  6-channel  version 
of  processor  (2).  The  4-channel  and  6-channel 
interleaved  systems  shared  an  amplitude  com¬ 
pression  scheme  in  which  the  top  60  dB  of  the 
envelope  output  range  was  logarithmically  mapped 
into  the  subject's  dynamic  range.  The  commercial, 
4-channel  analogue  system  uses  an  automatic  gain 
control  that  acts  on  the  signal  before  processing  by 
the  bandpass  filters. 

One  subject's  ability  to  identify  closed  sets  of  con¬ 
sonants  (24  consonants  presented  in  a  /aCa/ 


context)  and  vowels  (8  vowels  presented  in  a  /bVt/ 
context)  has  been  measured  for  these  three  pro¬ 
cessing  schemes.  Scores  for  consonant  recogni¬ 
tion  using  the  analogue  and  interleaved,  4-channel 
processors  were  87  percent  and  89  percent  respec¬ 
tively.  While  the  difference  in  consonant  recogni¬ 
tion  is  not  significant  for  the  4-channel  processors, 
these  scores  are  significantly  different  from  the  96 
percent  scored  with  the  6-channel,  interleaved 
processor.  This  implies  that  for  consonant  identifi¬ 
cation,  differences  in  performance  were  due  to 
increasing  the  number  of  channels  but  differences 
in  compression  and  interleaving  were  probably  not 
significant.  The  vowel  scores  were  79  percent  and 
87  percent  for  the  4-channel  systems  (analogue 
and  interleaved  respectively)  and  98  percent  for  the 
6-channel,  interleaved  system.  Differences  in  the 
two  4-channel  system  scores  suggest  that  differ¬ 
ences  in  compression  and/or  interleaving  may  con¬ 
tribute  to  better  vowel  recognition.  As  with  the 
consonant  scores,  increasing  the  number  of  chan¬ 
nels  also  contributes  to  better  vowel  scores. 

Two  different  psychophysical  tests  conducted  with 
implanted  subjects  this  year  are  leading  to  the 
design  of  new  speech  processors  that  will  be  tested 
next  year.  Loudness  scaling  experiments  using 
electrical  stimulation  of  the  subjects'  intracochlear 
electrodes  show  that  the  shape  of  the  loudness 
growth  function  varies  across  electrodes.  These 
results  suggest  customizing  the  function  used  to 
map  channel  energy  to  stimulus  amplitude  for  each 
electrode  rather  than  using  the  same  function  as  is 
currently  done.  Psychophysical  measures  of  inter¬ 
action  during  electrical  stimulation  of  nonsimul- 
taneous  stimuli  at  the  same  or  different  electrodes 
have  revealed  complex  patterns  of  masking  and 
unmasking  that  depend  on  the  amplitude  of  the 
masker  and  the  separation  of  the  masker  and  test 
stimuli  in  both  time  and  space.  These  measures  of 
nonsimultaneous  interactions  have  suggested  new 
stimulus  waveforms  that  may  reduce  these  inter¬ 
actions.  These  waveforms  are  being  tested  in  our 
single-unit  models  and,  in  the  next  year,  will  be 
tested  as  part  of  a  6-channel,  interleaved  sound 
processor. 


Wilson.  B.S.,  C.C.  Finley,  D.T.  Lawson,  R.D.  Wolford,  D.K.  Eddington  and  W.M.  Rabinowitz,  "  Better  Speech  Recognition  with 
Cochlear  Implants,"  Nature  352:  236-238  (1991). 
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Chapter  1.  Linguistics 


Academic  and  Research  Staff 

Professor  Morris  Halle,  Professor  Noam  A.  Chomsky 


1.1  Introduction 

The  work  of  the  Linguistics  group  is  directed 
towards  a  better  understanding  of  the  mental 
capacities  of  human  beings  through  the  study  of  the 
nature,  acquisition  and  use  of  language.  Language 
is  a  uniquely  human  faculty:  only  humans  appear  to 
be  capable  of  learning  and  using  a  language,  and 
every  normal  human  acquires  knowledge  of  one  or 
more  languages. 

We  are  trying  to  understand  how  this  linguistic 
knowledge  is  represented  in  the  speaker’s  mind. 
The  central  issues  of  linguistics  research  are: 

1.  What  is  the  nature  of  linguistic  knowledge? 
What  do  speakers  of  a  particular  language 
such  as  Latvian,  Spanish  or  Walpiri  know,  and 
how  does  knowledge  of  one  language 
resemble  or  differ  from  that  of  another  lan¬ 
guage? 

2.  How  do  speakers  acquire  this  knowledge? 

3.  How  do  speakers  put  this  knowledge  to  use  in 
producing  and  understanding  utterances? 

4.  What  are  the  physiological  mechanisms  that 
provide  the  material  basis  for  storage,  acquisi¬ 
tion  and  utilization  of  linguistic  knowledge? 

Our  ability  to  answer  these  questions  differs  consid¬ 
erably,  and  our  research  reflects  these  differences. 
At  present,  we  have  progressed  further  with  regard 
to  answering  the  questions  posed  by  item  one  and 
have  made  less  progress  with  item  four.  Currently, 
our  research  is  heavily  concentrated  on  issues  con¬ 
cerned  with  the  nature  of  the  knowledge  that  char¬ 
acterizes  fluent  speakers  of  various  languages. 
However,  we  are  making  a  significant  effort  to  solve 
the  other  questions  also. 

We  are  studying  these  topics  along  a  number  of 
parallel  lines.  Linguists  have  investigated  the  prin¬ 
ciples  by  which  words  are  concatenated  to  form 
meaningful  sentences.  These  principles  have  been 
the  primary  domain  of  inquiry  into  the  disciplines  of 
syntax  and  semantics.  Phonology  studies  the 


sound  structure  of  words  while  morphology  exam¬ 
ines  the  manner  in  which  different  languages 
combine  different  meaning-bearing  units  (specif¬ 
ically,  stems,  prefixes,  suffixes  and  infixes)  to  form 
words.  The  latter  topic  has  attracted  increasing 
interest  in  recent  years  and  will  probably  become 
more  prominent  in  our  research  efforts  in  the  future. 


1.2  Abstracts  of  Doctoral  Dissertations 

The  following  are  abstracts  of  dissertations  sub¬ 
mitted  in  1992  to  the  Department  of  Linguistics  and 
Philosophy  in  partial  fulfillment  of  the  requirements 
for  the  degree  of  Doctor  of  Philosophy  in  Linguis¬ 
tics. 


1.2.1  Subjects  and  Complementizers 

Philip  Branigan 

Abstract 

This  thesis  is  concerned  with  the  position  that 
subject  and  objects  occupy  at  LF.  It  is  demon¬ 
strated  that  objects  must  occupy  a  Case  position 
outside  of  VP  at  LF  in  order  to  account  for  the 
effects  of  objects  in  participle  agreement  configura¬ 
tions  and  in  configurations  in  which  they  interact 
with  'dependent'  elements  in  VP  adjuncts.  It  is 
shown  that  subjects  occupy  a  non-Case  position 
located  between  CP  and  IP,  a  position  which  can 
also  be  occupied  by  inverted  locative  phrases  in 
English.  The  A-bar  status  of  this  position  leads  to 
'that-trace'  effects  and  a  greater  sensitivity  to  A-bar 
extraction  of  the  subject  from  inside  an  island. 


1.2.2  Adjunct  Extraction  and  Chain 
Configurations 

Michael  V.  Hegarty 


Chapter  1.  Linguistics 


Abstract 

This  study  is  concerned  with  the  semantic  content 
of  compiementizers,  and  with  the  modes  of  interpre¬ 
tation  of  adverbiai  wh  eiements  and  with  their  possi¬ 
bilities  for  wh  movement.  It  examines  the  event 
structure  of  complement  clauses,  revealing  a  differ¬ 
ence  of  semantic  function  between  the 
complementizers  in  tensed  factive  complement 
clauses  and  propositional  complement  clauses. 
This  yields  a  difference  in  their  effects  on  adverbial 
wh  extraction.  The  effect  of  complementizers  on 
wh  extraction  points  to  the  Minimality  Condition, 
taking  effect  for  nongoverning  heads  such  as 
complementizers.  An  attempt  is  made  to  define 
antecedent  government  in  such  a  way  as  to  capture 
the  Minimality  effects  of  complementizers. 

Chapter  1  reviews  some  pertinent  aspects  of  the 
underlying  syntactic  and  semantic  frameworks,  and 
contains  a  brief  outline  of  the  thesis. 

Chapter  2  examines  the  event  structure  of  comple¬ 
ment  clauses.  It  argues  that  the  event  position  of 
factive  complements  is  discharged  by  the  factive 
complementizer,  yielding  an  interpretation  of  factive 
complements  as  involving  discourse  binding  of  the 
event  position  of  the  complement  clause.  It  then 
examines  the  event  position  of  various  infinitival 
complement  clauses  and  of  gerunds,  as  well  as 
some  other  contexts  which  involve  discourse 
binding  of  an  event  variable,  and  which  exhibit  the 
resulting  presuppositon  that  the  event  occurred.  In 
all  cases,  event  positions  which  are  not  bound  are 
found  to  exhibit  the  possibility  of  unselective  binding 
by  an  adverb  of  quantification. 

Chapter  3  examines  the  pattern  of  adverbial  wh 
extraction  from  fractive  and  propositional  comple¬ 
ment  clauses  and  shows  that  this  pattern  depends 
on  the  event  structures  of  the  complement  clauses 
detailed  in  Chapter  2.  Adverbial  wh  extraction  out 
of  infinitivals  and  gerunds  is  then  examined,  along 
with  extraction  from  some  other  contexts  which 
involve  discourse  binding  of  the  event  variable. 

Chapter  4  examines  some  simplifications  in 
extraction  theory  that  follow  from  a  typology  of 
chains  at  LF  proposed  by  Chomsky.  On  the  basis 
of  these  simplifications,  a  radically  reduced  system 
of  y-marking  is  proposed  for  extraction  theory.  A 
definition  of  antecedent  government  is  then  formu¬ 
lated  so  as  to  capture  the  Minimality  effects  on 
antecedent  government  of  intervening 
complementizers. 


Chapter  5  summarizes  the  results  of  the  thesis  and 
discusses  them  in  the  light  of  some  recent  litera¬ 
ture. 


1.2.3  The  Computation  of  Prosody 

William  J.  Idsardi 

Abstract 

This  thesis  presents  a  new  theory  of  metrical  repre¬ 
sentations  and  computations.  This  theory  empha¬ 
sizes  that  the  metrical  grid  is  a  separate  module  for 
the  phonology,  devoted  to  the  calculation  of  parti¬ 
tioning  of  phonological  elements.  The  metrical  grid 
consists  of  parallel  tiers  composed  of  three  kinds  of 
elements:  grid  marks  and  left  and  right  boundaries. 
A  single  boundary  serves  to  define  a  metrical  con¬ 
stituent:  a  left  boundary  creates  a  grouping  of  the 
elements  to  its  right,  a  right  boundary  creates  a 
grouping  of  the  elements  to  its  left. 

The  calculation  of  the  metrical  grid  is  accomplished 
through  the  use  of  both  rules  and  constraints.  This 
division  of  labor  accounts  for  observed  properties  of 
stress  systems  in  a  succinct  manner.  Metrical  rules 
apply  successively  In  a  derivation,  thus  modelling 
the  functional  character  of  metrical  structure  assign¬ 
ment.  The  constraints  prevent  the  application  of 
metrical  rules  that  would  generate  universal  or 
language-particular  disfavored  configurations. 

The  interface  to  the  metrical  grid  module  is  con¬ 
trolled  by  two  parameters  of  projection,  which 
provide  the  initial  grid  marks  and  boundaries. 
Further  parameterized  rules  of  Edge  marking,  Itera¬ 
tive  constituent  construction  and  Headedness  com¬ 
plete  the  construction  of  the  grid. 

This  theory  allows  the  derivation  of  Extrametrically 
effects  through  the  interaction  between  Edge 
marking  and  Iterative  constituent  construction. 
Constraints  against  particular  configurations  yield 
both  clash  effects  and  a  ternary  parsing  ability. 

Further,  the  Edge  marking  parameter  provides  the 
requisite  formal  power  to  deal  with  stress  intro¬ 
duced  by  specific  morphemes  in  such  languages  as 
Turkish,  Macedonian,  Polish,  Russian,  Cayuvava, 
Shuswap  (Salish)  and  Moses-Columbian  (Salish). 


1.2.4  Japanese  Syntactic  Structures  and 
Their  Constructional  Meanings 

Miori  Kubo 
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Abstract 

This  dissertation  explores  the  sentential  structures 
of  Japanese,  of  both  matrix  and  embedded  clauses, 
with  special  attention  given  to  the  configurational 
relation  between  a  predicate  head  X°  and  its  argu¬ 
ments.  It  is  argued  that  two  quite  distinct  sentential 
forms,  IP  and  CP,  are  possible  in  Japanese.  A 
leading  idea  pursued  here  is  that  crucial  factors 
determining  the  configurational  relations  are  (i) 
whether  a  head  goes  through  syntactic  head  move¬ 
ment  (Travis  1984,  Baker  19S8)  and  (ii)  whether  a 
head  is  syntactically  filled  or  empty  (Emonds  1985). 
Among  the  sentence  types  researched  are  included 
not  only  sentences  with  single  verbs,  but  also  predi¬ 
cate  attribute  sentences,  complex  verb  sentences 
containing  causative,  desiderative  and  potential  suf¬ 
fixes,  and  a  representative  range  of  embedded  sen¬ 
tences. 

In  relation  to  head  movement,  I  argue  for  a  strict 
structural  case  marking  analysis  in  the  spirit  of 
Takezawa  (1987)  and  Morikawa  (1989)  and  show 
that  the  configurational  relations  between  case¬ 
assigning  heads  and  NPs  are  directly  mirrored  by 
morphological  case.  Both  SPEC-HEAD  agreement 
and  government  are  motivated  as  case  assigning 
mechanisms,  the  former  being  reserved,  however, 
for  only  subject-predicate  relations.  A  central  dis¬ 
covery  here  is  that  the  functional  categories  in 
Japanese,  C,  I,  and  0,  each  uniformly  assign  one 
and  only  one  so-called  nominative  case  ga;  this  in 
turn  supports  the  existence  of  functional  categories 
as  a  class. 

Another  effect  of  head  movement,  via  case 
marking,  related  to  possible  positions  for  subjects 
and  their  associated  configurational  meanings. 
That  is,  the  two  well-known  interpretations  of  NP- 
oa.  neutral  and  exhaustive,  are  argued  to  result 
from  them  occupying  different  positions  in  SPEC(I) 
and  SPEC(C).  The  distributional  distinctions 
between  SPEC(C)  and  SPEC(I)  are  in  turn  deter¬ 
mined  by  positions  of  the  corresponding  heads, 
which  are  governed  by  various  syntactic  con¬ 
straints.  In  the  course  of  investigation,  quite  inter¬ 
esting  parallels,  until  now  obscured,  are  brought  out 
between  noun  phrases  and  sentences. 

This  research  can  be  taken  as  a  concrete  effort  to 
investigate  various  structural  configurations  and 
their  configurational  meanings,  on  the  basis  of  the 
idea  that  meaning  is  in  essential  parts  simply  form. 
Among  the  contrasts  investigated  are  activity  versus 
stative  verbs,  exhaustive  versus  neutral  interpreta¬ 
tions  of  NP-  ga^  and  the  differing  syntactic  and 
interpretive  behaviors  of  the  two  clausal  types  CP 
and  IP.  In  a  final  chapter,  I  motivate  a  correlation 


between  these  two  clausal  structures  and  Kant's 
two  types  of  judgements,  analytic  and  synthetic. 


1.2.5  Coordination  and  Comparatives 

Friederike  Moltmann 

Abstract 

This  thesis  deals  with  three  topics,  coordination 
(chapter  1-3),  comparatives  (chapter  4)  and  excep¬ 
tion  phrases  (chapter  5). 

In  the  first  three  chapters,  I  will  present  a  new 
three-dimensional  theory  of  coordination  and  a  par¬ 
ticular  way  of  compositionally  interpreting  three- 
dimensional  syntactic  structures.  The  theory  is 
applied  to  ATB  constructions  in  chapter  1  and  to 
constructions  with  split  antecedents  in  different 
conjuncts  of  coordinate  in  chapter  2.  I  propose  that 
both  ATB  extraction  and  the  constructions  with  split 
antecedents  involve  a  new  construction  type  of 
implicit  coordination.  In  chapter  3,  I  will  discuss  the 
possibility  of  implicit  coordination  in  constructions 
other  than  'and’-coordination  and  ATB  extraction. 

Chapter  4  presents  a  general  syntactic  and 
semantic  analysis  of  comparative  constructions.  In 
particular,  it  presents  answers  to  the  following  three 
questions.  First,  how  do  quantifier  scope  inter¬ 
actions  come  about  in  comparative  clauses  and 
other  wh  constructions.  Second,  how  is  the  empty 
element  in  comparative  deletion  contexts  identified. 
Third,  what  is  the  syntactic  basis  for  evaluating 
comparative  clauses  the  way  they  are  most 
plausibly  evaluated,  namely  as  universal  quantifiers 
over  degree.  In  the  fourth  chapter,  I  will  also 
address  the  issue  of  whether  comparatives  involve 
coordination.  I  will  argue  that  comparative  sen¬ 
tences  may  have  two  syntactic  stnjctures  simul¬ 
taneously,  a  coordinate  structure  and  a  subordinate 
structure.  This  also  holds  for  phrasal  comparatives. 

Chapter  5  deals  with  various  kinds  of  exception 
phrases  and  a  number  of  other  constructions  which 
are  either  semantically  or  syntactically  related  to 
exception  constructions,  namely  'extent  clauses', 
'almost'-phrases  and  amount  relatives.  I  propose  a 
general  semantic  condition  to  account  for  the 
quantifier  constraint  imposed  by  exception  phrases 
and  semantically  related  constructions.  Further¬ 
more,  I  will  argue  that  exception  constructions  may 
involve  polyadic  quantification.  Regarding  the 
syntax  of  exception  phrases,  I  will  discuss  the  issue 
of  whether  exception  constructions  syntactically 
involve  coordination,  but  argue  against  this  possi¬ 
bility. 
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1.2.6  Crossing  and  Nested  Paths:  NP 
Movement  in  Accusative  and  Ergative 
Languages 

Kumiko  Murasugi 

Abstract 

This  thesis  develops  a  theory  of  syntactic  ergativity 
based  on  the  Case  feature  requirements  which 
motivate  NP  movement.  In  Chapter  1, 1  present  an 
Ergative  Parameter  derived  from  the  Case  features 
of  twofunctional  heads,  T(ense)  and  Tr(ansitivity). 
In  an  ergative  language,  the  Case  features  of  the 
higher  T(ense)  are  strong,  requiring  movement  to 
its  SPEC  at  s-structure.  In  an  accusative  language, 
the  Case  features  of  the  lower  Tr(ansitivity)  are 
strong,  forcing  s-structure  movement  to  its  SPEC. 
Economy  Principles  determine  that  it  is  always  the 
subject,  generated  in  SPEC  VP,  which  raises  to  the 
SPEC  of  T  or  Tr  (depending  on  the  language)  at 
s-structure.  At  LF,  the  object  raises  to  the 
remaining  SPEC  position.  This  results  in  two  types 
of  movement:  "Crossing  Paths"  in  accusative  lan¬ 
guages,  and  "Nested  Paths"  in  ergative  languages. 
The  two  paths  reflect  the  different  Case  and  agree¬ 
ment  patterns  found  in  the  two  language  types,  as 
each  functional  head  is  associated  with  a  particular 
Case:  nominative/absolutive  with  T,  and 

accusative/ergative  with  Tr.  In  an  intransitive 
clause,  the  subject  raises  to  SPEC  TP  in  both  types 
of  languages. 

Chapter  2  investigates  consequences  of  the  pro¬ 
posed  theory  for  NP  movement  and  verb  raising  in 
accusative  languages.  The  topics  relating  to  NP 
movement  include  expletives  in  SPEC  TP,  the  EPP 
in  infinitival  clauses,  and  raising  constructions  in 
English.  I  also  discuss  verb  fronting  in  V2  and  VSO 
languages,  proposing  that  this  movement  is 
required  by  a  [finite]  feature  in  C. 

In  chapter  3,  I  provide  evidence  for  my  claim  that 
ergativity  involves  the  raising  of  the  object  (O)  and 
intransitive  subject  (S)  to  SPEC  TP,  the  position  of 
transitive  (A)  and  intransitive  (S)  subjects  in 
accusative  languages.  I  discuss  properties  shared 
by  0/S  in  an  ergative  language,  and  A/S  in  an 
accusative  language,  such  as  the  availability  for 
relativization. 

In  Chapter  4,  I  investigate  syntactic  and  semantic 
properties  which  group  together  S  and  A  in  both 
language  types,  claiming  that  they  involve  the 
notion  of  highest  s-structure  argument,  and  the¬ 
matic  roles.  Chapter  5  discusses  various  issues 
pertaining  to  ergativity. 


1.2.7  Features,  Positions  and  Affixes  in 
Autonomous  Morphological  Structure 

Robert  R.  Noyer 

Abstract 

This  thesis  develops  a  theory  of  the  mapping  from 
the  output  of  syntactic  rules  to  the  input  to 
phonology  within  a  component  of  Morphology. 
Three  types  of  information  are  shown  to  be  inde¬ 
pendently  manipulable  in  the  algorithm  deriving  fully 
inflected  words:  morphosyntactic  features,  phono¬ 
logical  material  (affixes),  and  positions-of- 
exponence. 

To  capture  categorial  neutralizations  at  the  morpho¬ 
logical  level,  a  filter-based  approach  to  the 
Impoverishment  of  morphosyntactic  representations 
is  proposed.  Filters  are  argued  to  be  automatically 
assumed  unless  positive  evidence  suppresses  them 
in  the  learner.  Neutralizations  are  also  shown  to 
occur  at  three  other  levels:  (1)  underspecification  of 
realization  rules/affixes,  (2)  disjunction  of  affixes 
competing  for  a  position-of-exponence,  and  (3) 
transparency  of  redundant  morphosyntactic  values. 
A  hierarchy  of  features  is  proposed,  which,  in  the 
unmarked  instance,  governs  the  direction  of  catego¬ 
rial  neutralization  and  the  ordering  of  affixes  and 
morphological  rules,  where  not  determined  by 
syntax  or  by  the  principle  of  descending  complexity 
due  to  Panini. 

The  mapping  to  phonological  form  is  proposed  to 
take  place  in  two  phases.  In  the  first  phase, 
morphosyntactic  well-formedness  is  established 
through  the  Impoverishment  of  morphosyntactic  fea¬ 
tures  and  the  Linearization,  Rebracketing,  Merger 
(Marantz  1988)  and  Fusion  of  syntactic  heads, 
giving  a  level  of  constituents  termed  M°s.  In  the 
second  phase,  the  string  M°  is  mapped  to  the  well- 
formed  morphological  word  through  rules  and 
affixes.  Morphological  words  are  shown  to  display 
idiosyncratic  non-phonological  conditions  of  well- 
formedness  (obligatory  or  limited  positions-of- 
exponence)  which  comprise  autonomous 
morphological  structure.  Particular  attention  is  paid 
to  cases  of  M°  splitting,  where  features  from  a 
single  morphosyntactic  source  are  realized  at  mul¬ 
tiple  positions. 

Features  in  input  morphosyntactic  representations 
are  argued  to  be  discharged  through  the 
instantiation  of  affixes,  sometimes  at  obligatory 
positions-of-exponence,  which  are  also  then  dis¬ 
charged.  This  discharge  constitutes  spell-out  or 
realization  of  phonological  material.  Because  both 
features  and  positions  may  be  discharged  indepen¬ 
dently,  it  becomes  possible  to  express  disjunction 
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across  position  classes,  termed  here  discontinuous 
bleeding.  Further,  the  Rule-Affix  Hypothesis  states 
that  rules  change  phonological  material  and  do  not 
discharge  morphosyntactic  features;  affixes  add 
phonological  material  and  do  discharge  such  lea- 
tures. 

The  Introduction  situates  the  current  proposal  within 
prevailing  theories  of  inflection,  Lieber  (1980,  1992), 
Anderson  (1981,  1992),  and  Halle  (1989a,  1992). 
Chapter  1  exemplifies  the  theory  through  a  detailed 
analysis  of  the  Afroasiatic  prefix-conjugation. 
Chapter  2  gives  an  explicit  theory  of  person-number 
features  and  their  interpretation,  in  particular  the 
inherent  number  system  of  Kiowa-Tanoan.  Chapter 
3  treats  complex  multiple  argument  agreement 
systems  with  particular  attention  to  the  transitive 
clitic  sequences  of  Nunggubuyu  and  Ket. 


1.2.8  Syllables  and  Templates:  Evidence 
from  Southern  Sierra  Miwok 

Kelly  D.  Sloan 

Abstract 

This  thesis  examines  aspects  of  the  syllable  struc¬ 
ture  and  templatic  phonology  of  Southern  Sierra 
Miwok  (henceforth  SSM).  SSM  provides  strong  evi¬ 
dence  for  a  model  of  phonological  organization 
incorporating  an  x-slot  tier  (a  'segmental  model’).  I 
advance  three  arguments  in  favor  of  a  segmental 
model.  First,  I  argue  that  SSM  exhibits  floating 
x-slots  and  floating  phonemes.  SSM  has  a 
complex  array  of  length  and  alternation  phe¬ 
nomena.  I  demonstrate  that  by  utilizing  singly- 
linked  phonemes,  doubly-linked  phonemes,  floating 
x-slots,  floating  phonemes  and  combinations  of 
these,  all  of  the  surface  forms  can  be  derived  by 
syllabification  without  recourse  to  any  special 
lengthening,  shortening  or  deletion  rules.  I  propose 
a  hierarchy  of  elements  in  which  x-slots  with 
phonemic  content  have  precedence  over  floating 
x-slots  which  in  turn  have  precedence  over  floating 
phonemes. 

Second,  I  show  that  the  behavior  of  morpheme-final 
geminates  in  SSM  is  inconsistent  with  the  morale 
analysis  of  gemination  proposed  in  Hayes  (1989). 
Only  a  model  which  represents  geminates  as 
holding  two  positions  or  'slots'  will  properly  account 
for  the  gemination  facts.  The  third  type  of  evidence 
comes  from  the  templatic  system  of  SSM.  I  dem¬ 
onstrate  that  SSM  exhibits  the  three  distinct  tem¬ 
plates  CVeve,  CVCVX  and  CVCVV.  I  propose 
that  these  can  be  represented  by  taking  advantage 
of  the  distinctions  between  branching  and  non¬ 
branching  Nucleus  and  between  floating  and  non¬ 


floating  x-slots.  These  distinctions  readily  follow 
from  the  x-slot  model  I  advance  herein  but  are  una¬ 
vailable  in  a  moraic  model  of  phonological  organiza¬ 
tion. 

The  properties  of  syllabification  in  SSM  appear  par¬ 
adoxical.  On  the  one  hand,  it  can  be  demonstrated 
that  syllabification  must  apply  at  the  word-level  in 
order  to  account  for  the  surface  variation  in  vowel 
length.  On  the  other  hand,  in  order  to  explain  the 
distribution  of  epenthesis,  syllabification  must 
arguably  take  place  prior  to  the  word-level.  I 
propose  a  solution  to  this  dilemma  by  arguing  that 
syllabification  in  SSM  takes  place  at  two  discrete 
stages.  CV(X)  syllables  are  constructed  at  the 
lexical  level.  Subsequent  word-level  syllabification 
builds  syllables  from  right-to-left,  respecting  previ¬ 
ously  built  structure  and  Maximality.  I  demonstrate 
that  Prosodic  Syllabification,  as  proposed  in  Ito 
(1986,  1989)  cannot  correctly  derive  the  SSM  facts. 
This  results  from  two  (potentially)  problematic 
aspects  of  SSM  syllabification;  (i)  although 
syllabification  must  not  apply  cyclically,  there  are 
certain  cyclic  effects  observed  and  (ii)  epenthesis 
can  only  occur  at  morpheme  boundaries  even 
though  these  boundaries  are  not  visible  at  the  point 
in  the  phonology  where  syllabification  must  occur. 
These  effects  fall  out  of  the  analysis  that  I  propose 
herein. 


1.2.9  Deletion,  Deaccenting,  and 
Presupposition 

Christopher  D.  Tancredi 

Abstract 

In  this  dissertation,  I  examine  the  effects  of 
deaccenting— the  removal  of  phonological  accent 
from  a  constituent — on  interpretation.  In  general, 
deaccenting  of  an  element  is  possible  only  if  that 
element  is  salient  in  the  discourse  context. 
Salience  alone,  however,  is  not  a  sufficient  condi¬ 
tion  for  deaccenting.  The  discourse  context  in  (1) 
makes  salient  both  the  verb  hit  and  the  NP  John, 
and  it  is  consequently  possible  for  these  elements 
to  be  deaccented  in  (1a,b).  However,  in  (1c),  it  is 
not  possible  to  deaccent  both  of  these  elements 
simultaneously.  (Focus  is  indicated  by  CAPITAL¬ 
IZATION,  deaccenting  by  small  italics.) 

(1)  Mary  hit  John.  Then, 

a.  BILL  hit  SUE. 

b.  BILL  KICKED  Mary. 

c.  #BILL  hit  Mary. 

Deaccenting  of  both  the  verb  hit  and  its  direct 
object  Mary  in  (1c)  requires  that  the  discourse 
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context  make  salient  a  hitting  of  Mary,  but  this  con¬ 
dition  is  not  met  in  (1). 

To  account  tor  the  facts  illustrated  above,  I  propose 
that  deaccenting  plays  a  role  in  identifying  the 
focus-related  topic  of  a  sentence,  where  it  is  a  nec¬ 
essary  condition  for  a  sentence  to  be  felicitous  in  a 
given  context  that  the  focus-related  topic  of  that 
sentence  be  instantiated  in  the  context.  The  focus- 
related  topic  of  a  sentence  is  generated  (roughly) 
by  replacing  all  focused  constituents  by  variables 
and  combining  the  resulting  structure  so  as  to  end 
up  with  the  smallest  structure  within  which  all  prop¬ 
erties  of  the  remaining  lexical  elements  are  satis¬ 
fied.  The  resulting  structure  will  be  instantiated  in  a 
context  if  there  is  another  element  in  the  context 
with  which  it  is  non-distinct,  where  variables  count 
as  non-distinct  from  other  elements  of  the  same 
semantic  type.  By  this  process,  the  sentences  in 

(1)  will  have  the  focus-related  topics  in  (2). 

(2)  a.  X  hit  y 

b.  Mary 

c.  X  hit  Mary 

The  representations  in  (2a,b)  are  instantiated  in  the 
context  consisting  of  the  sentence  Mary  hit  John  in 
(1)  above,  and  hence  (1a,b)  are  felicitous.(2c), 
however,  is  not  instantiated  in  this  context,  and 
hence  is  infelicitous. 


The  analysis  sketched  above  provides  an  explan¬ 
ation  for  the  felicity  or  infelicity  of  a  sentence  in  a 
larger  context  based  upon  the  focus  structure  of  the 
sentence  and  the  composition  of  the  context.  While 
the  analysis  is  of  some  interest  by  itself,  even  more 
important  is  the  use  to  which  the  analysis  can  be 
put  in  accounting  for  certain  phenomena  typically 
associated  with  VP  deletion.  Since  Sag  (1976)  and 
Williams  (1977),  it  has  standardly  been  assumed 
that  restrictions  on  pronominal  interpretations  in  VP 
deletion  contexts  are  to  be  explained  in  terms  of  the 
mechanism  that  assigns  an  interpretation  to  an 
empty  VP.  However,  we  find  that  identical 
restrictions  appear  in  contexts  in  which  a  VP  has 
been  deaccented  but  not  deleted.  Thus,  parallel  to 
the  sentence  in  (3a)  which  has  only  a  strict  and  a 
sloppy  reading  available  for  the  (deleted)  pronoun, 
we  find  an  identical  restriction  for  the  deaccented 
pronoun  in  (3b). 

(3)  a.  John,  said  he,  is  a  genius  because  Bill  did. 

b.  John,  said  he,  is  a  genius  because  Bill  said 
he's  intelligent. 

No  analysis  of  VP  deletion  in  the  current  literature 
is  capable  of  accounting  for  this  parallelism.  If  we 
assume  that  a  phonologically  deleted  VP  is  repres¬ 
ented  in  the  LF  representation  of  a  sentence  as  a 
deaccented  VP,  however,  it  becomes  possible  to 
account  for  this  parallelism  in  a  principled  fashion 
by  assimilating  the  deletion  cases  to  the  deac¬ 
centing  ones. 
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“Doubly-Strained  lnAIAs/n*-lnGaAs  HFETs.” 
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Rita  C.  McKinnon 


Postdoctoral  Fellows  and  Associates 


Tomas  A.  Arias 
Robert  C.  Armstrong 
Alice  M.  Berglund 
Peter  A.  Cariani 
Ibrahim  M.  Elfadel 
Susan  L.  Goldman 
Joseph  A.  Izatt 


Research  Assistants 

Charles  C.  Abnet 
Daniel  P.  Aalberls 
Mazher  M.  Alidina 
Christopher  B.  Anderson 
John  G.  Apostolopoulos 
Mark  A.  Armstrong 
Neer  R.  Asherie 


Wolfgang  Ketterle 
Pauline  M.  McMahon 
Robert  D.  Meade 
Gale  S.  Petrich 
David  P.  Pullman 
Sunil  Puria 

Christine  M.  Rankovic 


David  B.  Askey 
William  W.  Au 
Walter  A.  Aviles 
Ian  M.  Avruch 
Sandeep  Bahi 
Donald  G.  Baltus 
Deborah  J.  Becker 
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Abhay  K.  Ram 
Dennis  D.  Rathman 
John  J.  Rosowski 
Stefanie  Shattuck-Hufnagel 
Linda  E.  Sugiyama 
Mario  A.  Svirsky 
Hong  Z.  Tan 
Ngai  Chuen  Wong 
Y.  Eric  Yang 


William  H.  Smith 
Vicky-Lynn  Taylor 


Gina  L.B.  Milton 
Mark  K.  Mondol 
Donald  K.  North 
Robert  H.  Priest 
B'  jce  A.  Russell 
Maxine  P.  Samuels 
Sally  C.  Santiago 
Robert  D.  Sisson 
Clare  F.  Smith 
Mabayoje  Tuyo 
Arlene  E.  Wint 
Sharon  J.  Worley 
Mary  J.  Ziegler 


Barrett  Rogers 
M.  Selim  Shahriar 
Annie  H.  Takeuchi 
Xin  Xu 

Xinglong  Yan 
Yi  Yuan 


Paul  A.  Belk 
Keren  Bergman 
Vishal  L.  Bhagwati 
Igor  P.  Bilinsky 
Luc  Boivin 
Michael  P.  Bradley 
Louis  R.  Brothers 


Appendix  B.  Current  RLE  Personnel 


Martin  Burkhardt 
Carlos  R.  Cabrera 
Xuejun  Cai 
Claudio  L.  Cesar 
Michael  S.  Chapman 
Grace  H.  Chen 
Jerry  C.  Chen 
Jyh-Shing  Chen 
Tak  K.  Cheng 
Tony  Chiang 
Dmitri  Chkiovskii 
Kyeongjae  Cho 
Woo-Young  Choi 
Mike  T.  Chou 
Jeffrey  A.  Colborn 
Samuel  R.  Conner 
Christopher  A.  Coronado 
Michael  W.  Courtney 
Pierre  Coutu 
Jay  H.  Damask 
Kiran  B.  Dandekar 
Charles  Q.  Davis 
Kendall  Davis 
Anthony  D.  Della  Ratta 
Frank  DiFilippo 
Christopher  R.  Doerr 
David  J.  Dougherty 
Paul  Duchnowski 
Scott  B.  Dynes 
Christopher  R.  Ekstrom 
John  D.  Ellithorpe 
Darin  R.  Ernst 
Hong  T.  Ewe 
Alexis  Falicov 
Paul  W.  Fieguth 
Siegfried  B.  Fleischer 
Andre  B.  Fletcher 
Ethan  B.  Foxman 
Joseph  A.  Frisbie 
Eric  M.  Fuchs 
Alex  W.  Fung 
Swetlana  Gaffron 
Edouard  A.  Garcia 
Boris  Golubovic 
James  G.  Goodberlet 
Mark  R.  Griffith 
Andrew  E.  Grumet 
Nitin  Gupta 


Teaching  Assistants 

John  R.  Buck 
Shiufun  Cheung 
Kevin  G.  Christian 


Rakesh  Gupta 
Troy  D.  Hammond 
Scott  D.  Hector 
Michael  R.  Hee 
Lori  K.  Herold 
Eason  Ho 
Jeffrey  R.  Holley 
William  C.  Hoston 
Chih-Chien  Hsu 
Wen  Hu 
David  Huang 
Charles  T.  Huitgren 
Brian  R.  Jacobson 
Louise  Jandura 
Hong  Jiao 
Michael  A.  Joffe 
Joel  T.  Johnson 
Mark  A.  Johnson 
Mattan  Kamon 
Charles  A.  Katz 
Sumanth  Kaushik 
Farzana  I.  Khatri 
Songmin  Kim 
Hong-Kwang  J.  Kuo 
Bobby  Y.  Lai 
Cheung-Wei  Lam 
Warren  M.  Lam 
Dicky  Lee 
Laurence  H.  Lee 
Zachary  K.  Lee 
Gadi  Lenz 
Stan  Y.  Liao 
Sharlene  A.  Liu 
Kan  Lu 

Robert  I.  Lut  'ak 
Ilya  Lyubomirsky 
Kamal  M.  Malek 
Paul  S.  Martin 
Noel  S.  Massey 
Alexander  Matusis 
Joseph  A.  Maxwell 
Marc  O.  Mewes 
Peter  A.  Monta 
Christopher  B.  Moore 
John  D.  Moores 
Martin  Muendel 
Christian  R.  Musil 
Keith  S.  Nabors 


Kyle  K.  Iwai 
Stephen  F.  Scherock 
Mohammad  A.  Tassoudji 


Malini  V.  Narayanan 
Vasant  Natarajan 
Julien  J.  Nicolas 
James  M.  Njeru 
Lily  Y.  Pang 

Hanalabos  Papadopoulos 
Thomas  J.  Paul 
Gregory  E.  Penn 
Michael  O.  Polley 
Kelly  L.  Poort 
Matthew  H.  Power 
Khalid  Rahmat 
Malini  Ramaswamy 
Monte  J.  Ramstad 
Mark  W.  Reichelt 
Michael  D.  Richard 
Caterina  Riconda 
Michael  J.  Schwartz 
Mark  N.  Seidel 
Lisa  Shatz 
Amelia  H.  Shen 
Gennady  Shvets 
Luis  M.  Silveira 
Andrew  C.  Singer 
Jurgen  H.  Smet 
Stephen  P.  Smith 
Gideon  P.  Stein 
Richard  E.  Stoner 
Ke-Xun  Sun 
Lon  E.  Sunshine 
Kohichi  R.  Tamura 
Guillermo  J.  Tearney 
Farzam  Toudeh-Fallah 
Morrison  Ulman 
Jesus  Noel  Villasenor 
Pavel  S.  Volfbeyn 
Susie  J.  Wee 
Lorin  F.  Wilde 
Timothy  A.  Wilson 
John  W.  Wissinger 
Vincent  V.  Wong 
Albert  R.  Woo 
Rolf  A.  Wyss 
Julius  J.  Yang 
Nathan  Yang 
Kenneth  W.  Yee 
Chang  D.  Yoo 
Kambiz  C.  Zangi 


Murat  E.  Veysoglu 
Kathleen  E.  Wage 
Li-Fang  Wang 
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Graduate  Students 

Laura  E.  Adams 
Aaron  J.  Berg 
Karl  K.  Berggren 
Philippe  Berisset 
Jeffrey  K.  Bounds 
Stanley  W.  Brown 
Palmyra  E.  Catravas 
Hwa-Ping  Chang 
Irfan  U.  Chaudhary 
Marilyn  Y.  Chen 
Yiu  Y.  Chu 
Kevin  M.  Cuomo 
Ali  Darwish 
William  S.  Daughton 
David  M.  DeCaprio 
Joseph  G.  Desloge 
Daniel  J.  DiLorenzo 
Shepard  S.  Doeleman 
Philip  A.  Fisher 
Jun  Funatsu 
Eric  J.  Gaidos 
Renato  Gatto 


Undergraduate  Students 

Tracy  E.  Adams 
Daniel  A.  Alvarez 
Michele  M.  Bierbaum 
Amy  S.  Chang 
Beth  L.  Chen 
Erika  Chuang 
Henry  E.  Chung 
Leanne  C.  Clarke 
John  A.  Crouch 
Gail  Denesvich 
Laura  C.  Dilley 
Jonathan  C.  Doan 
Ilya  Entin 

Mauricio  A.  Escobar 
Darlene  J.  Ford 
Helena  H.  Geng 
Rajashi  Ghosh 
Andrew  C.  Goldish 
Daniel  I.  Goldman 
Matthew  K.  Gray 
Mamie  Harker 
Akikazu  Hashimoto 
Todd  A.  Hay 
David  H.  Hijirida 
Jason  Hintersteiner 
Surendranauth  Hiraman 


David  S.  Gloss 
Julie  E.  Greenberg 
Ibrahim  A.  Hajjahmad 
Katherine  L.  Hall 
Jody  House 
Steven  H.  Isabelle 
Erik  B.  Iverson 
Yasuo  Kamatani 
Chwen-yuen  Ku 
Volkan  C.  Kubali 
Daniel  D.  Lee 
Kevin  Li 
Peggy  Li 
Nicole  Love 
Kent  H.  Lundberg 
Artem  Maloratsky 
Michael  M.  McCue 
Ignacio  S.  McQuirk 
Mark  D.  Messier 
Michael  C.  Moldoveanu 
Ann  W.  Morgenthaler 
Alex  X.  Mou 


Elliot  E.  Hui 
Mohammad  J.  Khan 
Sungtae  Kim 
Teresa  W.  Lau 
Jean  Q.  Lee 
William  K.  Lee 
Julie  C.  Lew 
Huiying  Li 
Kuo-Yi  Lim 
Michael  H.  Lim 
Adam  P.  London 
Henry  Y.  Lu 
David  S.  Lum 
Kristine  W.  Ma 
Robert  J.  Macke 
Mahbubul  A.  Majumdar 
Alexander  R.  Mallin 
Michael  L.  McComas 
Chad  H.  Mikkelson 
Rachel  E.  Molenaar 
Jason  T.  Mueller 
Philip  M.  Nadeau 
Lee  P.  Ng 
Hana  Ohkawa 
Dennis  O.  Ouma 
Charlie  C.  Pan 


Aradhana  Narula 
Lynn  E.  Nelson 
Janet  L.  Pan 
Pamela  M.  Paufler 
Denis  J.  Peregiym 
Aleksandar  Pfajfer 
Joel  R.  Phillips 
Adam  D.  Polcyn 
David  V.  Pynadath 
Todd  H.  Rider 
■^rank  L.  Ross 
Ante  Salcedo 
Jon  C.  Sandberg 
Steven  D.  Schultz 
Hugh  E.  Seeker-Walker 
Barbara  G.  Shinn 
Chin  H.  Tan 
Ricardo  Telichevesky 
Christopher  B.  Umminger 
Luigi  Vacca 
Shawn  M.  Verbout 


Amrit  R.  Pant 
Sang  H.  Park 
Jonathan  D.  Pelly 
Michael  T.  Pierce 
Vlad  Portnoy 
Farhan  Rana 
Kenneth  N.  Ricci 
Brian  A.  Rubin 
Gaeta  H.  Sankappanavar 
Anthony  R.  Scherer 
Jiri  Schindler 
Satyen  Shah 

Christopher  M.  Smithhisler 
Abraham  D.  Stroock 
Jie  Sun 

Prasanna  B.  Tambe 
Bridget  E.  Tannian 
Stanley  H.  Thompson 
Ann  L.  Torres 
Andrew  Ugarov 
Monika  K.  Waiczak 
Janet  Wang 
Joshua  N.  Winn 
John  J.  Wuu 
Ross  A.  Yu 


368  RLE  Progress  Report  Number  1 35 


Appendix  B.  Current  RLE  Personnel 


Visiting  Scientists 

Vladimir  Barsukov 
Corine  A.  Bickley 
Franco  Carpignano 
Guang-Sup  Cho 
Steve  Cohn 
Paolo  Detragiache 
Theodore  W.  Ducas 
Anna  Esposito 
Kaigham  J.  Gabriel 
Artur  D.  Gouveia-Neto 
Rogeve  Gulati 
Christian  V.  Hafner 
Yie  He 


James  J.  Hugunin 
Arthur  K.  Jordan 
Hae  M. Jung 
Stephan  Jungling 
Franz  X.  Kartner 
Nobuyoshi  Koshida 
Jack  Kotik 
Takahide  Matsuoka 
Jose  C.  Monteiro 
Phylis  Morrison 
Thomas  B.  Mueller 
Guilia  A.  Nassi 


Marco  Nassi 
Hamid  Nawab 
Kevin  O’Neill 
Jerome  M.  Paye 
Karen  L.  Payton 
Geoffrey  L.  Plant 
Stanley  J.  Rosenthal 
Makoto  Sakamoto 
Masataka  Shirasaki 
Chi-Kuang  Sun 
George  M.  Svolos 
Yunxian  Tong 


Research  Affiliates 


Mitchell  R.  Balonon-Rosen 

Eva  B.  Holmberg 

Joseph  F.  Rizzo 

John  S.  Barlow 

Caroline  B.  Huang 

Jay  T.  Rubinstein 

Herbert  J.  Bernstein 

Yoshiko  Ito 

Steven  R.  Rueman 

Giuseppe  Berlin 

Joseph  A.  Jarrell 

Madhav  K.  Setty 

Suzanne  Boyce 

John  D.  Kierstead 

Robert  T.  Shin 

Frank  S.  Cardarelli 

Lance  G.  Joneckis 

Richard  J.  Solomon 

H.  Steven  Colburn 

Harlan  Lane 

Elizabeth  E.  Sperry 

Daniel  Ehrlich 

Charles  P.  Lin 

Frank  J.  Stefanov-Wagner 

Carol  Y.  Espy-Wilson 

Neil  A.  Macmillan 

David  A.  Steffens 

Ignacio  Garcia-Otero 

John  1.  Makhoul 

Joseph  Tierney 

Gad  Geiger 

Sharon  Y.  Manuel 

Michael  J.  Tsuk 

Bernard  Gold 

Bruce  R.  Musicus 

Kenneth  P.  Wacks 

Richard  S.  Goldhor 

Mark  R.  Nilsen 

Ehud  Weinstein 

Kenneth  W.  Grant 

Leonard  L.  Picard 

David  R.  Williams 

Robert  D.  Hall 

Soon  Y.  Poh 

Jane  W.  Wozniak 

Philip  R.  Hemmer 

James  C.  Preisig 

Jiqing  Xia 

Robert  E.  Hillman 

Mara  G.  Prentiss 

Marc  Zissman 

Mark  A.  Hollis 

Stephen  A.  Raymond 

Promotions  • 

RLE  is  pleased  to  acknowledge  that  the  faculty 
and  staff  shown  on  this  page  received  pro¬ 
motions  during  1992. 

# 


Mr.  Oizheng  Gu  (Photo  by  John  F.  Cook) 
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Appendix  C.  Milestones 


C.1  New  Faculty  and  Staff 

Dr.  Raymond  C.  Ashoori  joined  RLE's  Quantum- 
Effect  Devices  Group  as  Assistant  Professor  of 
Physics  in  January  1993. 

Mr.  Joseph  F.  Connolly  was  appointed  RLE 
Administrative  Officer  in  October  1992. 

Dr.  David  L.  Ze//zer  joined  RLE's  Sensory  Commu¬ 
nication  Group  in  January  1993. 


C.2  Promotions 

Dr.  Srinivas  Devadas  was  promoted  to  Associate 
Professor  of  Electrical  Engineering  and  Computer 
Science  in  July  1992.  Dr.  Devadas  is  a  member  of 
RLE's  Circuits  and  Systems  Group. 

Mr.  Qizheng  Gu  was  appointed  Research  Scientist 
in  RLE's  Center  for  Elecfomagnetic  Theory  and 
Applications  in  February  1992. 

Dr.  John  J.  Guinan,  Jr.,  Principal  Research  Scien¬ 
tist  in  RLE's  Auditory  Physiology  Group,  was 
appointed  Associate  Director  at  the  Massachusetts 
Eye  and  Ear  Infirmary  in  January  1992. 

Dr.  Leslie  A.  Kolodziejski,  Karl  Van  Tassel  Career 
Development  Assistant  Professor,  was  promoted  to 
Associate  Professor  of  Electrical  Engineering  and 
Computer  Science  in  July  1992 

Dr.  Melanie  L.  Matthies  was  promoted  to 
Research  Scientist  in  RLE's  Speech  Communi¬ 
cation  Group  in  May  1 992. 


C.3  Chair  Appointments 

Dr.  Thomas  F.  Weiss,  Professor  of  Electrical  and 
Bioengineering  and  Acuity  member  in  RLE's 
Auditory  Physiology  Group,  was  appointed  Thomas 
and  Gerd  Perkins  Professor  of  Electrical  Engi¬ 
neering  and  Computer  Science  in  May  1992. 


C.4  Awards  and  Honors 

Dr.  Sylvia  T.  Ceyer,  Professor  ot  Chemistry  and 
faculty  member  in  RLE’s  Surfaces  and  Interfaces 
Group,  was  elected  fellow  of  the  American 
Academy  of  Ads  and  Sciences  in  April  1992. 

Dr.  Srinivas  Devadas,  Associate  Professor  of  Elec¬ 
trical  Engineering  and  Computer  Science  and  a 
member  of  RLE’s  Circuits  and  Systems  Group, 
received  a  1992  Young  Investigator  Award  from  the 
National  Science  Foundation. 

Dr.  Daniel  Kleppner,  Associate  Director  of  RLE 
and  Lester  Wolfe  Professor  of  Physics,  was  elected 
fellow  of  the  Optical  Society  of  America  in  May 
1992.  Professor  Kleppner  is  also  a  faculty  member 
in  RLE's  Atomic,  Molecular,  and  Optical  Physics 
Group. 

Dr.  Joseph  S.  Perkell,  Senior  Research  Scientist 
in  the  Speech  Communication  Group,  was  elected 
fellow  of  the  Acoustical  Society  of  America  in  May 
1992. 

Dr.  William  F.  Schreiber,  Professor  Emeritus  of 
Electrical  Engineering  and  Computer  Science,  was 
awarded  the  1992  Journal  Award  of  the  Society  of 
Motion  Picture  and  Television  Engineers  (SMPTE). 
This  was  the  fourth  time  Professor  Schreiber 
received  the  Journal  Award. 

Dr.  Jerome  B.  Wiesner,  President  Emeritus  of  MIT 
and  Director  Emeritus  of  RLE,  received  the  National 
Science  Board’s  Vannevar  Bush  Bush  Award  in 
April  1992. 
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Appendix  D.  RLE  Research  Support  Index 


3M  Company  26-29 
Ampex  263-267 

Advanced  Television  Research  Program  263-267 
American  Broadcasting  Corporation  263-267 
Analog  Devices,  Inc.  227-229 
AT&T  Bell  Laboratories  17,  258-259 
Bose  Corporation  90-94 
Consortium  for  Superconducting  Electronics 
63-64,  209-210 

Defense  Advanced  Research  Projects  Agency 
(DARPA)  7-8,  10-13,  17,  26-33,  47-50,  53-54, 
63-66,  112-113,  209-210,  242-245,  247-252, 
256-262 

Dennis  Klatt  Memorial  Fund  282-289 
Digital  Equipment  Corporation  205-206,  241-242, 
282-289 

Draper  (Charles  S.)  Laboratory  10-11,  19-23,  26, 
72-74,  105-106,  151-153 
Federal  Bureau  of  Investigation  240-241 
Friedman  (F.L.)  Chair  171-172 
Fujitsu  Laboratories  19-23,  73-74 
Future  Energy  Applied  Technology,  Inc.  100-102 
General  Instrument  263-267 
GTE  Laboratories  12 
Hertz  Foundation  11-14 
Houston  Advanced  Research  Center  275-276 
IBM  Corporation  54-56,  227-229,  238-245 
Joint  Services  Electronics  Program  (JSEP) 

7-10,  19-23,  26-30,  39-48,  50-56,  61-62,  65, 
71-72,  75-85,  119-127,  131-140,  155-157, 
161-169,  205-206 
Kodak  Corporation  263-267 
Lawrence  Livermore  National  Laboratories 
90-94,  181-184 

LeBel  (C.J.)  Foundation  282-288 
Lockheed  Sanders,  Inc.  257-259,  261 
Maryland  Procurement  Office  103-105 
Micrion  34 

MIT  Leaders  for  Manufacturing  Program 
219-221 

MIT  Lincoln  Laboratory  72,  94-98,  109-110, 
218-219 


MIT-Woods  Hole  Oceanographic  Institution 
Joint  Program  256-257,  259-260 
MOSIS  12-13 

Motorola  Corporation  263-267 
NCIPT  7-8,  10-13,  17 

National  Aeronautics  and  Space  Administration 
65-66,  109-110,  181,  184-186,  189-191, 

203-205,  216-219 

National  Center  for  Integrated  Photonics  74-77 

National  Institute  of  Standards  and  Technology 
105-106 

National  Institutes  of  Health  85-90,  282-289, 
293-306,  309-311,  319-327 

National  Science  Foundation  8-9,  12-17,  26-29, 
33-34,  41-44,  47-53,  61,  72-77,  79-85,  110-112, 
128-129,  155-163,  175-186,  189,  209,  215-219, 
229-242,  260,  262,  282-289 
Northeastern  University  1 6 
Packard  (David  and  Lucille)  Foundation  217-218 
Public  Broadcasting  System  263-267 
Rockwell  International  Corporation  97-98 
Semiconductor  Research  Corporation  33-34 
SM  Systems  and  Research,  Inc.  218-219 
Space  Exploration  Associates  100-102 
Tektronix  Corporation  263-267 
Texas  Instruments,  Inc.  14,  19-23 
Toshiba  Corporation,  Ltd.  8-9 
TRW  Systems  16-17 

U.S.  Air  Force  -  Electronic  Systems  Division 
145-151 

U.S.  Air  Force  -  Office  of  Scientific  Research 
57-62,  71-72,  75-85,  107,  175-181, 

257-262,  306-307 

U.S.  Army  -  Corps  of  Engineers  203-205 
U.S.  Army  -  Harry  Diamond  Laboratories  175-181 
U.S.  Army  Research  Office  14-16,  31-34,  50-52, 
62-63,  105-106,  114-116,  163-166 
U.S.  Department  of  Energy  90-94,  129-130,  175-181, 
184-189,  192-199 

U.S.  Department  of  Transportation  207 
U.S.  Navy  -  Office  of  Naval  Research  34,  73-74, 
79-90,  106-107,  148-149,  158-161,  163-169, 
175-181,  203-206,  238-245,  247-252,  256-262, 
267-268,  307-309,  311-315 

Vitesse  Semiconductor  11-12 
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Project  Staff  and  Subject  Index 
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Auditory  system  319 — 327 
Avruch,  Ian  M.  215,216 


Project  Staff  and  Subject  Index 

A 


Aalberts,  Daniel  P.  119,  120 
Abernathy,  Douglas  L.  137 
Abnet,  Charles  C.  319,  321 
Acioli,  Lucio  H.  71,79,81 
Acoustic  Thermometry  of  Ocean  Climate  (ATOC) 
Program  256 
Adams,  Laura  E.  71 
Adams,  Tracy  E.  281 

Advanced  Microwave  Sounding  Unit  218,  219 

Advanced  Television  Research  Program  263 — 267 

Advanced  X-ray  Astrophysics  Facility  65 

Aggarwal,  Rajni  J.  7,  14 

Air  traffic  control  systems  207 

AlcatorC-MOD  192 

Aldridge,  Mary  C.  71 

Ali,  Sami  M.  203,  209—212,  227 

Aliberti,  Giovanni  255 

Alidina,  Mazhar  M.  227,  242 

Alkhairy,  Ashraf  S.  215,  220 

Alien,  Jonathan  227 — 245,  281 

Altshuler,  Boris  L.  39 — 40 

Alvarez,  Daniel  A.  293 

Analog  processing 

Wavefront  correction  107 
Ananthapadmanabha,  Tirupattur  V.  281 
Ananthraman,  Santosh  293 
Annaswamy,  Anuradha  31 1 
Antoniadis,  Dimitri  A.  31,  32,  33,  54,  57,  58,  59, 

61,  227,  239 

Apostolopoulos,  John  G.  263,  264 
Arias,  Tomas  A.  131 
Ariel,  Imadiel  39,45,131 
Armstrong,  Mark  A.  31 ,  33 
Armstrong,  Robert  C.  227,  229 
Arnold,  David  V.  203,  206 
Array  processing  256 
Articulation  index  294 
Asherie,  Neer  R.  1 75,  1 92 
Ashoori,  Raymond  C.  371 
AT&T  Bell  Laboratories  269 
Atkins,  Robert  G.  203,  206 
Atom-beam  interferometry  47 
Atomic  physics  155 — 169 
Atoms 

Diffraction  of  163 

Structure  in  magnetic  fields  155, 158 

Trapping  and  cooling  166 
Au,  William  W.  203 
Aucoin,  Richard  J.  47,  66 


B 

Bach,  Susan  E.  293 
Baggeroer,  Arthur  B.  255,  256 
Bahl,  Sandeep  R.  19 
Bai,  Kelly  S.  227,242 
Baltus,  Donald  G.  227,  228 
Balzer,  Janice  L  319 
Barrett,  John  W.  215,  219 
Barsukov,  Vladimir  M.  281 
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